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(54) GENE FOR CELLULOSE-SYNTHESIZING ENZYME 

(57)Abstract 

PROBLEM TO BE SOLVED: To obtain the subject new gene 
comprising a DNA encoding a protein containing a specific amino 
acid sequence and having a cellulose-synthesizing enzyme activity, 
and useful for being inserted into prokaryotic cells or eukaryotic 
cells to control the production of cellulose. 
SOLUTION: This gene for an enzyme for synthesizing cellulose 
comprises a new DNA which encodes a protein comprising an amino 
acid sequence of formula I, formula II, etc., or an amino acid 
sequence of formula I, formula II, etc., in which one to several amino 
acids are deleted, replaced, inserted or added, and having the 
activity of a cellulose-synthesizing enzyme, and which encodes the 
cellulose-synthesizing enzyme originated from cotton (Gossypium 
hirsutum), and is useful for being inserted into prokaryotic cells or 
eukaryotic cells to control the production of cellulose. The DNA is 
obtained by separating a mRNA from the fiber cell of the cotton, 
making a cDNA library from the mRNA, screening the library with a 
probe having a sequence determined from a data base, and 
subsequently recovering the DNA from the positive clone. 
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(54) [2£^<o£fl;] irJPn-x^fife^^M^^ 
(57) l^m 

— x <z> £ g * m w *r * *6 1 c # ^ > 5 n & -t ;u a - x ^ 

WTtf) (A) - (C) <D\,*-ftlfr<D? > 

(A) IB*IJ#^ 2 1:^1-7 
^y&ffi?ik XteK^J#^ 2 K:fcUT 1 ^L< te&f@cD 

7i*y&a**^ if a&i, <\*ttmz titer si y 

^^W^S. (B) KJiJ#^4 tC^TT^ y^EJij, 

xiiK^j#^4 ic*n>r 1 :£L< cor ^ y&a**: 
3c, Sift, if A^b< li^^^n^r a y Rs^^fe 

n. (o sb^j#^8 i:^t7 ^ y s^e^j, xiiS2^j# 

^8K:fcV>T l^L<«M075/i^^, (lift, 

1 K^-rr a y 3usefj#^i Mc^^ri^ 

Kii»i075/si«^ sift. If a;£L< tettira 



PosA3 



PoM3-5' 



PC8A3-3" 



PmA3-682 



IIf#^l] £4T(D (A) - (C) CDU i rn^(D^> 
<A) SB^i#^2 (C^Tr 5 y^gB^iJ. Xteffi?iJ##2 

i^H^titKntiors/i?^^, aft, jf A 

(B) ffi^J#§4l:^7$ytE3fiK XteE*3J#^4 
l^^tlf Kli^i(D75/S?^^ t gft > SA 

ffi?U#^l 1 l;it7 ^ XtefiB?!)#^l lie 

^^Ti^i<n»:i07ay^^^, 

[Hf#^3] 1 IB ^ COD N A Tjf^ff 

SB2^J^Wr^RNA^^^^it^C<h^J:0, ffigBin 
[ 0 0 0 1 ] 

t^^WIt^M^^J *B9Itt. V? (Gossypinm 
birsntnn) Etj^cDir^D-x^i^*:}- Kf^DN 
A, Zft&<$tS&m?LDN A. mf!2DN ATWi$ 

[0 0 0 2] 

^tttrS, g&,S<!b LT(Df X^;p 
-i^^m (P. oryzae) CD £ e> , -feJPn — X££f£<z> 
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^t>JfcfflTt!r*. iS^. ?*li*£*ffl]i&&*<DcDNAsCD^ 

>-tr >X^fTV>, ^£&Jir^D-x^jj£&ifcit 
^(bacterial BcsA) £ cDfg|^tetf> £ , 5c±m&Ce\M 
£*5Z±m&Ce\At&<? *±)l>n-X<&fo#m c DNA 
Tfc*^&tt**$8££*i*: (Pearet ah, Proceeding o 
f National Acadeiy of Science, USA (1996) 93, 1263 
7-1 2642). CelAHcoHTWU UDP-gl ncose t 
TKZtlfrtf. CelA2t^UTJi:^(cC^^r 5 S mM&l 

[ 0 0 0 3] 
[0 0 0 4] 

^ ^fttftffiWfc c DN A ££*j5£L, c DNA7< ^ 

&2 U-XDmm&&lU>£^\Zttl,X 6 ^©75 y 
-X 4 - ^ >;u h ^>x^oi^-ii (be s 

a) mfc^zysL ymmw\zm€&x_rct><Dt. ttm& 

JO s A 1 , P c s A 2 , RZtP c s A 3 £ffr*Lfc (PcsA 
It* Plant Cellulose Synthase AOI§T&£) . 
[0 0 0 5] ?Zsit>t>*&mte* £KT<D (A) - (C) 

(A) E«f 2l:,Tt7 5 y®?i2^J. XteE3RJ#^2 

«c^^Ti^L<«^<@<^r^y^^>:^, ©ft, jfA 

(b) ffi^j#^4^^-rT5 ys?ffi^»j> x«S2jij#^4 

(C) ffi^J#^8(c^-rr^y^S2^j, XfifiB^J#^8 

^i<«w$nfc7^ ygE?y, s^j#^i iic 
^-rr^y^ie^. x«e^j#^i iic^^tiei< 

[0 0 0 6] *fc*%gg« % JtBaDNAO^ffiSXte-gB 
iO ^r^trlfift^^^^-, Rt/iiaDNA-C^MIEftSn 



3 

[0007] *x9h*&&k:. ^g2DN A&mwzmx 

[0 0 0 8] a, E#J#^ 1 \tP c s A 1 ©, E^J#^ 
3«Pc s A 2 IC IS MS-r*. £3*J## 5 \Z 

P c s A 3 <D 3 ' E^J#^ 7ttPc s A 3 CD 

5 ' <B!J®^, ffi^J#^ 9liPc s A 3 <Dft&m*&(D&Fl 

[0 0 0 9] ±IBD NAOp^, PcsAl^Pcs 
A2\Z, ^^t^D-X^^t^^-Ht-SDNAT 

mmm ^ k & » * -7 * -t ;u d - x ^ m m& & *> ia m 

*T£ C *h^^$nfco S 6 C P c s A 1 RZSP c s A 

2iift5p c sA3^^^i:i^ai5tifc. 

^^c. COj^^TPc s Al&i^Pc s A2CHIT 
[0010] Pc s A3l:||LTfi, P3D P c s 

a z<Dtf)\,-7\zm^-z> tm.t>nz>w\<o2 a-><o 5 ; 

[0011] PcsAl, Pc sA2ttt$6l;^ 

— >03' gP^cOffi^lJi; D y^-f V — Sr^ir U P C R £ 
fT&^, ORF$:ttJ^P->^§Ac e RACE&Srff 
^^^{Cffll^^f^Ji, mRNAcfcO^^l/cc DN 

[0 0 12] >^A ^-^>XCD&£L -fe^D- 

X^fifc&iliiJ&frtX* 1 5 p — ><DftT* 

felZftftLfzOlZ P c s A 2 CD 7 ^ p — >T&o d 

[0013] 

[00 14] <l>7^cDNA7^y7'J-CO^|! 
r7^cDNA7^f7*7 'J-£fe/£-f *£#CDmRN A£ 
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^tR«inB^#£ u^- mRN Aommmizftizmm* 

&<> ji«CD^[#j^6cDmRNACD|fiHJ^Sr»ffl-r^c: 
<h^T^r^o cDNA^It Wz-l*. mRNA*S 
fC#£-f y AX^ U*^ KfCtB 'J Tffi3*J£ 

m&^mmiz&*)ftmDNA*:<&f& 

oTf£b*§£. ^O^&te, #J;U;£Nolecolar Cloning 
(Maniat is Cold Spring Harbour Laboratory) KIBifc 

[00 15] <2>c DN ACD^SK^JCD^^ 
Hbtlt cDNA7-f^7'J-^b7 >^AfC^ D — > 

20 -5. ^^SB^J<D^^{i. Maxan-Gilberl j^^^^ia^-f 
Tt^y&l:J;oTff7^<h^Tf cn^©7^ 

[0 0 16] ^-f 7^^^->^{c J;^^^iB^i<D^^^fT 

[0 0 17] <3>ifife^^-^^-X<h<D^qEU>?- 

^^nfclcDNA^D-XQl^jJ, jf^^^ 

- 9 — X IC $ tlT -5 K^l CDir ;i/ Q -x 

— X 4-j3-y;^y^h7>X7x7— if (BcsA) 
40 16^3-Ht*»*^*tfsn5 (Wong.H.C. et a 

I . , Proc. Nat I . Acad. Sci. U.S.A. 87, 81 30-81 34 (1 99 
0), ACCESSION No. M37202) „ 
[0 0 18] f-^^-^tltlt «D^77tX 

^^f^e^^^tlXV^GenBank, EMBL, DDBJ^^&ftj 
ffl-TSut^t^, ^ : 6Dy-^^ffiC0 7'D^7AtL 
Tte, meScDDN Ag^rV 7 K DNASIS (BiV 7 
X7l>y^7 U >£0 , GENETYI (SDCV7h^i70l 

50 myL&te¥mftffi<D±Mmn&£&mt: $^Lt 
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<omm*. <i>*-*yb±T. NCBHNatiooalCenter 
for Biotechnology Infornation)'\J£|£LT (http://w 
ww.ncbi.nla.nih.gov/BLAST/) BLASKBasic Local Alig 
naent Search Tool) £ £#Jffl L ^ p 

Dayhoff, Atlas of Protein Sequence and Structore, 
vol. 5 (1978)^#M) l:«oT*tDy-X37$gH 

^BLAST©5*blasfx£«JBT* &3t(D 
#«**tt^*fl=r4. DDBJ News Letter No. 15 (Februa 
ry l 995)*#M"Ttl«<fc^. 

[0 0 2 0] <4>7mo-7^»icDNA^ 

^»>M*am5. Molecn 
lar Cloning fg 2 Kg (Maniat is 6. Cold Spring Harbonr 
Laboratory) 12. 30-12. 40£#HLTrr A tf«fcv>. £ 
H^tlfccDNA^5' #J£;fc n T ^ |c 12, 

fOcDNAffi^|$:5' *^$J^#:£T 1* £ £ 5 lc :/ ^ 
-rv-££j£U mRNA^^il: RT-PCR^ff^ 

UDP-g!«cose^^ K*-f ^Tl^75 y _ 

^JGInXXXXXXArgTrp (B2^J#^ 1 2) £n_^ LT; fc 

R^c«E^^i$n^7$/^j^fi^ s ^j 

S^l-1 0 {C^T. S2*J#lfl, 3«P C sAl&y 
PcsA2©iflSS5!l$^/r^L t fiW2, 4teP 
c sAlMPc sA2©ttIfi^^fS$ni7$ 



[0 0 2 2] ffi^J#^5RO:SB^J#^6 fi, Pc 

SA303' mm®Z^t!*U-> ( Pc$A 3- 682 ) 

^-T. Se^J#^7Rt;E^i#^8 PcsA3CD5- 
• ■ttfc^tfMOJrD-^S* (PcsA3-5') com 

3' (PcsA3-3') O^SgB^iR^CO^SK^J^^ 

SB^J#^-5 liPc s A 3 (D 3 ' MM®. S2£iJ# 
^7l*PcsA3(D5' mm®, EWf9ttPc s A 
3©rt***fc»3-r*. PcsA3-682 <hPcsA3-3'<D*-- 

KK^JT 1 y^gl^-ptt^, PcsA3-682<hPcsA3- 
ro<KSE^(Dii:tt« % BI3XrX4(c:^T. PcsA 
3-682<tPcsA3-3 , ^3-H-r^T^ / * L 

£!*J#^1 *3**ICCInXXXXXXArgTr 
P (S«§12) ©Eflltt, PcsAI(-§ibT«. K^iJ# 
^2 K*5U-<5T^ y^#^710^e»7!4-C^O, PcsA2K 
PSLTtt, ffi^#^4{C*l*S75ygg#^778*>£782 
T&O. PcsA3(CliLTtt, EJiJ#*f 6 tc*5 tf* 7 ^ 7 
&#-^356j$> £360-?&£ o 

[ 0 0 2 3] P c s A Hi, Pear et at. (Proceeding o 
f National Acadeny of Science, USA (1996) 93, 1263 
7-12642) €>a*$8£rL*:CeIAI £ , i£gE^Jtf<28ifi£Sfc 

&<t-?Z>rci& (Yaaanolo and Hakoaori, The Journal o 
f Biological Cheiistry (1990) 265, 19257-19262) % 

ce i ai )\, d - x ^ m it * w-r -5 * >/t ^ H £ 

3-Hf5^g^OTit*§. CelAKOn— Hf 
^75 y«K^JIC*5^T48#B©Arg, 56#SOSer, 81 
#@OAsik 104#@(DAIa, 110#aoSer, 247#gOAs 
P, 376#B<DAsp, 386#gOSer, 409#BOArg, 649# 
@OSerte, Pc sAl-CiiftlftlGIn, lie, Ser, Th 
r. Pro, Asn. Gin, Pro, His, GlyT&-5 e 

[0 0 2 4] *^iOPc SA2I2, Pear et a 

i'&&&OfrCe\A2£m-V>mm*:<$/u~V\**2>&. CelA2 
«T^±^"Cfe0, 3-HI«iM^T^^. Ce 
IA2«, E^J#^ 3 *C^-T P c s A 2 CO^^E^J(C*5^ 
T, ifi£#^I083~33!1l;:;f§3-r*. SB^J#^ 2 , 4, 

8, i o&tfi i izTikTT* ymmmt^-rn^^ 

[0 0 2 5] Sfc, ±I2T^ /m&FKDo*. *&W<D 

&te?<»ft&tz%nmiz&w£i*^te^ mo% y^ ym 

«MAD«, E^j##2, 4, 6, 8, 1 O&tfl ! (c^ 
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-T*. -fe^D-X^f&PiiiiStehJ:. #J;U*, T. Hayasb 
i: MeasoriDg-0-glacaD deposition in plant cell va 
lis, 1d Modern Methods of Plant Analys is : Plant Fib 
ers, eds. HF. Linskens and JF. Jackson, Springer-V 
erlag, 10: 138-I60 (I 989) tC I2« £ tlT t> £ \Z <fc 

[0 0 2 6] r 7^(D«55S«CJ:oX, 
3, 5, 7 Xte9 \Z7ji-?mm&?\\<Do*>. Zi-Fm®.<D 

'J>^x>h^^^j til, ^b^^nsw^A-f yu 
^•ffftir^. z<D&ft&wmiz&mik'rz> z t^mm 

-rxb, f net o ffi^j4^ti^^R±^/\-f y ij y-r 

[ 0 0 2 7 3 < 5 >*5£W<Dmfc^F<D#\m 
^ "feX (SaccharoByces)Jg^[C)||-r^>^S, «7^^CD<g 

b, c com^ ^-^^a^tc^A-r ^> c: i jCcfc -p 

M~V\t* Invi IrogenttSStf)^ >rt&5&Mm3ry hPichia 

Expression KitRtf Z<D3r y h JC^5 £*1T ^ 
— pPIC9£J?H>5 C ttfTZZ. &%m<Dtg&ffl 

[ 0 0 2 8 I 

[0 0 2 9] <l>9^^6(DiRNAOSISS 
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<h bT *7 ^ (Gossypiua hirsutua L.) Coker312£& 
Oit^te^b, 375ag(DDTT£&;fccD£g;JraA*:&* 90 

^9sx:\zjmmvftummt& (30bm edtarcx i%sds£ 

^t*0. 2M*£&:f- h U ^A£pH9. 0 IZ Gt>i* fz&. v?X 

io \zt®^z>) ^20da\m^x<mwvrz. 

[0030] cntc;/n^y-HfK£iooQgjta*p*tf&# 

b, 40tT2^^^ >*^- h bfcgH6altf) 2M KCI 
MtKck<»#b£:&, l»*!SJ#eL, 
*a»#»'fc«-C20#H (4t) 1 2.000 g-C**#ttl 
±t3£ifciabTi¥>B&£[$:£b, ^X->U>y-iC 
^bT^MSrfrD, Cin^SiJcD^^^tC^b, ftiiJ^S 
1 Bl^feOSSngO^ffcU^^A^Jta^T^iNEieS 2MJC 
bfe. C©^$4tt-^^Ilfc^ jfcl&bfcRN 
A£20# 12,000 gTS^iLfe. #^tlfcRN.A(D 
20 fctJ&ti, J^-^bfc 2M£fcU 7*7 AT? 2 0^^«t«^-& 

[ 0 0 3 1 ] *§£*X*:RN A^, lOmM h U X^®?^ (pH 
7.5) X^2ng/BltC^^ e tp{3^^b, SUmmtf^^A 
&Z00mH\ZfcZ> J: z> mXtz^k. x 9 J — )V^z 70SK U Z> £ 
O\ZlJQx.-&0X;-e\Qftft J $>Lfc. 41C 15,000 rpa-C, 10 

ftmm>bftM'&* ttEi&m^mvmmAazmmvT rn 

Alit^iib^:. tORNA^^^Ilfc^f, 2 mgO 
iRNA^en^;. 
[0 0 3 2 ] <2>mRNA(Di^ 

30 ±!2tf ^nfciRNA^6mRNA£#U (A)' RN 
Ai^tlt^fiLfc. m$SJC«, ^U(A)' RNAlt 
ISlffl^Uzf (dT) S^{b^^^^XX^^>0ligotex-dT3 
0<Super> (Mftffi ckO^A) €rfflV^, iRNA 

1 agOOigftttCEIot ion buffer (mttiny 7 7 — : lOmM T 
ris-HCl (pH7. 5). laM EDTA, 0. IX SDS) £ Jbn A T fi 7? 
laltb. -etltCOl igotei-dT30<Super> 1 al ^JSjDtL, 65 
tt 5 ^r«1Jtol»b, Tk-hX 3 ^rP^^L^bfc cntCSM 
NaCI 0. 2nl€:j(JDA> 37T 1 0^ M -f > * ^ ^— h b tz '&\ 
5, 000 rpmX, 3^P^^^^r8i^> ±ffi < 

40 b^c. 

[0 0 3 3 ] ^1/7 h$r Washing Buffer 
— : lOnM Tris-HCI (pH7. 5), laM EDTA, O.SMNaCl, 0.1 
X SDS) 2.5al(C®^b> 1 5, 000 rpaT. 35^Pb1^^^ 
Bt&, ±fH^a^^< ^^bAc. ^l/y h^rTE Buffer 

laifcfflWbT. HE-<o^65 , cr5^Poiija^bfc. :n$ 

*±T3^F^Lrc©^, 15,000 rpa-C 3 ftmi&>bft 
mV, ±ffif^C07^ U (A)' mRN A^lnlJRb^:. ±gBO 
<fc^i:lT, lig<D±RNA^e>«J10/xg^)^U (A) f m 
RNA^menfc. ^(Do^05 ug$c DNA7^^ 
50 ^ 'J-Of^HlC^fflbfc. 
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[0 0 3 4] <3>c DNA7^f^7 'J — <Df£§3 
( 1 ) c DNA(D^j^ 

lE-CH^nfemRNA^ICfflu, St rat agenelfcM 

5.0m I 10 x 1st Strand Buffer <JBIE9RjSJfl,t „ 7 
7-) 

3.0*1 lOoM 1st Strand Methyl Nucleotide Mix (5-a 
etnyl dCTP , dATP, dGTP, dTTP M&m 

2.0*1 Linker-Priner ( U -* i 0 
H t 0 (±#£50* otCSBSffr^. ) 

1.0*1 RNase Block II (RNAseffl^^J) 

[0 0 3 5] ±zZ<D&&ftte* y h(Df*9^TfcD> Li 
nker-Priner<Z>£3Rjf*. mm&m 3 Uj^c 0 T 

^IS^I^cDNA^i^ri^m^ct^ 

±efi***«fc<«#l,&«. *U(A)' 
mRNA 5.0*g£floA, liSTlOMlLfc. £ *> tc 
2.5*1 M-MuLV RTase (igfe^gj^) 

£#»<50*Hc**) , «^fcH^L)S:« % 2 o 
LTS^^^^zr^^oig^Jc^i 37<CT60#££; 

[0 0 3 6] A\Z, UT<Dm%i*m?F\ZVt^T?-*.--7 

45.0* 1 c DNA-^i^ttf^« 
40.0*1 10X2nd Strand Buffer »J * ^ -ifJxfc 

6-0*1 2nd Strand Nucleotide Mixture (A G C 

302.0*1 H,0 3Q 
[ 0 0 3 7 ] I«T©^^lw lfc ^ RNasetDNA 

^Kfcttfc^a-r/OJKttfCfci L % 16*CT1505K c 
DNAI^©a^^^ C( 

0.8*1 RNase H (RNA^fP^^) 

7.5*1 DNA/tf U ^ ^ — tf I (10.0 n /* 1) 
[0 0 3 8 ] iiniC400*i<D^xy-^; ^DD*K 
(l:D Sttein*. <*<lfc#Lfc!£lsli&T2#&,fcL 

fe. ±WfC?fS400 * |C07 X /-^ . ^D D ^7UA^Jta 40 

TcD^^^r^AT c DNA £fctJB 

[0 0 3 9] 
33.3*1 3M ^tMJ^AW 
867.0* 1 I 00% X* 

[0 0 4 0] .n^OtT^ttSl, ^ii-C60^S 
^*K»#L/i. ^<Z>5*>39.0* I tZ&>T <D®m&mto 



10-276782 

10 

[ 0 0 4 1 ] 

5.0*1 I0XT4 DNA Polynerase Buffer <T4tf U ;* 

2.5*1 2.5«M dNTP Mix (A,G,C,T*g£^) 
3.5*1 T4 DHA#U*^—e (2 . 9 D / u , } 
[ 0 0 4 2] 37«CT30#/*tf$^ 50 * I O&if* * Jta 
^«l00( t IO7xy^ : ^DD*^Aft ll ,^ i 
T7^7lT2^Mlfe. -tftiClOO* I CD 47 D D*;U 
A £ Jul*. , ^f7^XlT2^L, i»l:£(T0 

7.0* I 3M h »Jr)A^^ 

226 * I 100% x^y~;|, 
[0 0 4 3] £CD&&£*± ( c307>^±4*®L, 4tT 

607>^^Ufc. tti££i5o* io80%x^ y-ju-e^j^ 

h£7.0* IcDEcoRI Adaptor (EcoR! 7 — ) Jgffcfc; 

fcEcoRIT^^-fclllgLfc. EcoRIT$^-cD£* 
1.0*1 lOXLigation Buffer ( 'J — t?£j£Jl!/t y 

7r-) 

10*1 lOnM ATP 
10* I T4 DNA U # — if 
[0 0 4 4] ^^I$g<^ l4t{: _ M±|t 

sft^MiznmL. siT<D®m&)m7Lx, ecorit^^ 

1.0*1 lOXLigation Buffer ( U # — 

7 7 — ) 

2. 0* I lOnM ATP 

6.0*1 H,0 

1.0*1 U*»X9l,**}**j—if (10. 0 o /* 
I) 

[0 0 4 5] STtTSO^K**-*, 70 t: T30#&3! L ft 

**0*, 37 ^^90^S^^^TXhoITLinker-Pri Be rJc 

28.0* 1 Xbol Buffer 
3-0*1 Xbol (45 u /* |) 

[0 0 4 6 ] C©E**fc5.0/* IO10XSTE (lOnM Tris 
-HCMpHS.O). IOOdM NaCI, lnM EDTA) ZM*. ~> 3 - 
^7^>hiiffli|^^ (Sepbacryl Spin Col 
oan) fc*mu 600 gT2^L^:^(ili^, 75? 
>s>ltLfz. mizzcomft* 3®«0igL, *tl 
*tl77*i, 3> 2. 3X^4iUc. 75*is3> 
3. 4££fctt v 7iy^ : ^DD*U(l:l)Slo 
*<fc<»#U £*&T2#&*LAc. ±falC^*CD*D 

<z>±smc 2 tecoioo%x* /-**jD*-2ot:fc-*a« 
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JUTftfrb/c. £ ^«C4 < C"C60^a'6U. i^nfccD 
NACD^U- y h£lOtf lCDfc£M*K®*g b*:. 

[0 0 4 7] (2) cDNA7<^7'J-Of^S 
J:^Tf§£tl£:-*$ac DN A£ A 7 7 — S^SS'*** 

-tca^t, 2a&*^*-£Pt$ab*:«, ^n.-^tc^ 

[0 0 4 8] 

2.5a I c DNAmm 10 
0. 5 xx I 10 x Ligat ion Buffer 
0.5m I lOnM ATP 

I.Oiil AZAP^^^-DNA (l#£g/jcl) 
0.5itl T4 DNAU# — if (4 Weiss u//xl) 
[0 0 4 9] (3) 77-yDNA0)77-yiaf^a) 

c DN A^lf ALfc7 7 — y^^^-^, Ot'hn- 
;^7^-y>^^7 h (Gigapackll Gold packaging ex 
tract ; St rat agene*t§g) &m^T7 T — iS$L^\Zrty 

^-zsyyvrc. mmi,rzm.m<D&Li&-Mmj:*7 K <Fr 20 

eeze / Thaw extract) lc*&&x_ 7 7 — z/J§i&£An A. 
5k±fCg#. ' -T <TtCI5/x iCDjg^ifc&Sx*;* (Sonic e 
xtract) ZMXs e-^ySr-f >^UXcfc<ig^L&. d 
*l£$l<3g>fr bT^i& (22*0 JC2 WPQikBLfe. ±12 
Sl£*&J;:500/x 1 CD 7 7 — i^ft/X s> 7 7"- (Phage Di I u 
tion Buffer) <£lJDA£*3lC20ii ico^ D o^JUA^rlni 
A, jg^bfco ^-T y 7 'J — CD^< ^ — £$l£"r<5fca6 
JC, 500 xz 1 cDTfcffiCDft 2 xx 1 1 8 zx 1 CO SM buffer ( 1 L 
NaCl 5.8g, MgS04-7H2 0 2g, 1M Tr i s-HC I (pH7. 5) 
50nl, 2% if^^> 5ml) T 1 : 1 0IC L „ lxzlco# 30 

*B§^XLl-BlueigH&<0D,„=0. 1) 

'SJWt— 7 r — >?R408 
[0 0 5 2] ±fe*g£#Uc 5bICD 2 X YTiSifc Srjta* , 37 

00 g-C5#&it>b*:. ±flt£5 f ;*7 > h b, i&^oL-y 

^bfc. :n^Ml, ±r1t£ioofg#iKb*:&&20 

U I £0D,„ #1. £ b*:;*:Jl§lgXLl-Bloe CO 

*£^i£200 /x 37t:T157>^&2-t*-/t. 1-100 

/x icOJ&^fSS:, 7 >tf ->'J >£^mB7W — MC 7 U 40 

— — £ ^ >^AKStt bT. 7U"fe D-;U£jto;LT-80 

[0053] (6) 7^x ^ wmm 

7^X^ F00IHR,, ProaegatfcUcOMagic ni n i -prep* 
^/h^^^TfT^^:. -80t:-C«#LT^V^c^^X^ 
F?:Mt^^iI(0^}$=£, 5nlcD2 XYTfcifclCjg 
IL, 37*C-C — WiLfc. 5»mSi& (4, 000 rpi, 
4*C) Lt±a^f* > h \Z £ 0l$3cb, ®^^Utyh 
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1 a ICD7 T-^Mf^^r^tl-^n, OD, (i ^0.5fC 
&££Ti£^b*:;*:Jl§^ PLK-F***CO*S#i£200 xx 1 1& 

\zzn>-rx >#Lti. ~ttzt>*>. plk-f'^^ 

7 7 -iSm&ZMSls, 37t:-C15^^||b, :tl^ 2 - 
3 ml h y 77#- (481C) ICJHJ*. it £ 37T; IC 
fcNZY 7^-7*1/- h'vMJIb*:. STrT-ffciS^b, 

miiLfc^^-^ ^scAT^-f ?-&wthLrz. torn 

Jfc, *<i * — ttl. 2X 10* pfu/BlT&^fc. 
[0 0 5 0 ) (4) ^-f 7^ ij — (Oil*© 

anb^a. — 7 tins so, oooco*a^&x.A^^u *7 r — y 

£^tf Ay — i?>#mmt . 0D6 0 0 rt*0. 5£&* £ 
■C»*Lfc*IB« PLK-F'ftcoig^eOOzx l£jtax., 37 

*5^*:6.5bIco h y^T^-^iflA, fcmrfct&J&T** 
fr>fel50u NZT 71/— McS/gb, 37t: T? 5 — 8 P#fyji& 
#7U— MwlOnKDSM Bn f f er & JjQ X , 4tt 

COSM Boffer^M^b^i^U 7n fc! U>^^~7(c^ 
W.lZ&y Is — h £ 2 ni 1 CO SM bof ferT U >X b, d 

^A&lDATS^b, ^ia-C15^&©b, 4, 000 gT5 

ftm>bisTmfc&m^it. z<D±m\z. ^m(dq.z%\z$, 

fslfe7-f 7*7 U -co^-T ^ — & ml 12 £. FimiZ ITI^ 
b^ci^^, 2.3X10* pfu/mlT^-p^. 
[0 0 5 1 ] (5) 77-yDNA^bOT'7X5 HCO 
^Dttlllix.77-yDNA5(i^, 7^X^FSBt>co 
-f >b*#^DOfe*b (In vivo Excision) ^fTofc. 50ml 
cozi -7J^^^L-7^tC^TcOt>co^?m^b. 37*CT15 

200 iz I 

200xzl O1X10 1 

I /x I (>1 X 10 1 pfu/nl) 
^<£:X y 'O H;U7^zL-7tc^b, 55rPfl3S^ (5,00 
0 rpm, 41C) ITiB^T^ > htCJ: DR&^bfc. 

[ 0 0 5 4 ] h(C300xx ICOCell Resuspensio 

d Solution (j®^.©^ffl^^) &tmx J: < fifll, Xy 

^> r 5 ;i/7^3.-7ic»b^. cn^ 2 ^WS^If-T? 

M^b, 300 I CDCel I Lysis Solution (mi&®mmm 
&) ^JtoAS^IC^c^ £-emWVti. $ ^IC300 m icoNe 
utralization Solution (^fD^^^) *Ju^.. ^Tti 

■3ta*Lfc«, loamafe (15,000 rpn) b^. 

[ 0 0 5 5 ] ±m<D&*%iL^3Ly ^> H^7^i-7 
(1.5BI) iC^bfc, iR3l*fc3y^, ^ -TJ^A, > 
»J>>? (ftltS) <DKi-C&*ab. > U >ylCl/y >S: 1 
BlAnfc. ±IB±«5:uC0yU>yC0^^aAia<«[ 
^b^:^. K^lb^:. ColuBn lashSolulion {*) ^ AVl 

&mmm ^ 2Bijtox.©§ibTtt^b, &&<Djt&i-~ 

^l^X 7 ^>h , ;l,7^a-7' CI. 5a I > rtfctyH 
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a) LAc. &ttii&£x,y^> K;i,7?a-yi:^L, 3M 

OmIIBJ*.*:. cn$:®^L (15000 rpi. 25#) , ±^ 
*&T* tt®lc70Xx*y-ji,£ i ■lUo^. ^tjc^^L 
^ (1 5000 rpa, 3#) „ x $ / — ;p \Z W 0 Bfc 

Ac. fctj»£20M tcD$&*K&<i§ri>L, -2ot;-e&#L 
Ac. :0^l/i|^ oT , # U 10 

[ 0 0 5 6] <4>c DNAcD^^K^JO^^RcXitfe 
(1) c DN AOlgSSB^JO^^ 

cDNAcolfi^jMli, T^-fH/HtyXfA 
Xtfc (ABi) iaDNAt-h^^r>f~373A^l i T 
frofc. ^>-^>XS^^^|t®lDye Primer Cycle Seq 
uencing* y M;J:0T3y7<7-^i iT ^ oi 

[ 0 0 5 7] (2) *^Py-tf 

e ^"-^w-^^ ^ ^^^^^ 3io^a->^/^ 
[0058] <5>^±m? u-><DmM 

( 1 ) 5* RACE 

^<T-£^f&U mRNA£«r^KRT-PCR£TTfc:o>c. 30 
[ 0 0 5 9] (l-a) first-strand DNAcd^jsS 
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0.5mI IOmioI gene-specific priaerl 
IMS total RNA 

DEPC-lreated H, 0 (£#£9mI tc ft: * £ 5 Ic 
£ ) 

[ 0 0 6 0] gene-specific priaeri^LT, PcsAlffitc 

^K^, PcsA2fflfc{igB^j#^i 6fB%o^^ffi^^^ 
^"^^UrfX^l/^-^H^, PcsA3ffl«C(ige^J#^ 1 7 40 

[0 0 6 1 ] ±f2£j£&£|g^riWcjg£L£&* &<j£ 
'fcLTSfc&fe^-yo&^fc^ ^ 70t:-C10^^ 

5XRT Buffer 5 m I 

25aM MgCI, 2. 5m I 

2aM dNTP lii 5m 1 

0. 1M DTT 2. 5 m I 

H, ° ^&wniL\\ztzz>&o\zmx%) 50 



*^*-7<o&&izmt 42 < Ctm#@h£. -ecuci 

//I SuperScriptl! RT (igfc^&^S GIBCO BRL) £Jnj 
A, 42t;T50#SfS£l*7c. ^(7) 

&70r-ri5tt©#, RIB******. 8<»4L T R 

-tCIxzl RNase H (3fc#&) ^jto*, 37*C T30#£ j£ $ 

[0 0 6 3] »^T, £&i£*K^£*i<5. iS^COy^ 
-f v-^*^ k*-^F*»< AcOfrlC. ^- ij >#~tt$g 
<£*^§2*^A, Quick Spin Coluans & JB tiT**^* j§ 
£fr&o£:. ;*7^Atf>tip I100XgT2# 

LTbufferSr&Tfc. 2/ ^ A (D 4"^ (c ±g21*Jft& 
£An, H00XgT4^F45i^L, ®Wi&l*mLtz. 
[0 0 6 4] (1-b) Poly(dC) tailing 

5 m I 5XCoCl t Buffer 
2.5^ I 2nM dCTP 

H:0 (±it&24ii\ IZteZ&^tzmm'?*) 

[0 0 6 5] ±i2£tlS8S£&<^L, 94«CT3#f8]© 

^fe. ^<^^i,x^zr — y<ommzm^vx. *±iz 

?£&)£iMt to^x^<^^u, 37*C-??10^rJlJKJf£:$-fr 

[0 0 6 6] (l-c) PCR^ 
iilBR^fft^^, 2.5mI Et9, J^TcQJg&^jra*.^ 
2. 5m 1 10XPCR Buffer 
2. 5m I 2aM dNTP aix 
0.5m1 Gene-specific primer2 
0.5m I Abridged Anchor Primer (GIBCO BRL) 

0.5m I Advantage Klentaq Polyaerase Mix (Clontec 
n) 

HtO (±ft*25u\\zUZ>J:o\zmB-rz>) 
[0 0 6 7] Gene-specific pr iaer2 £ V T«, PcsAl/8 

iz\tmmm^i smm<oiiL3tmn*if+ u n * * u 
^-^K^r, PcsA2ffijc«s^j#^ 1 9 mm.<K>m.mmm* 

#-rz>*V 3%2 PcsA3^Jc«ie^J#^2 
Ac. 

[ 0 0 6 8] 0.2al^^-^|cAn, ^T<D^#TPCRJK 
J^^frActo^c. 

PAD 9413 90#> 

94t: 30g> 
60-68^ 30—60g> 
681C 180© 
Final 68t: 7^ 

Hold 4tJ 
C0 0 6 9] R**or^D-xy^*«Hc»*f3rtt 
^. St)^t^^X (PcsAIT^1.8K,PcsA2T*?J2K,Pc 
sA3-C<^2.2K) OgP^ODNA^y;U^e«aiLfc. Affile 
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\t CLONTECH IfcCDGENO-BIND ^<D?Uhn-)[, 
Kftofc. C o LT^^n/cDNAfC Poly (dC> tai I ing £ 

[0 0 7 0] (2) *D-->y 
(2-a) 5* RACE©TA^O-z>^ 
^^nfcPCRS^f^^^. InvilrogenttO TA Cloning K 

it £JfH>T. *<D7u hn — )lizVt^. t? u — -yy 

SrfTft^fco ±fBT|§6n*:PCR£&&*l. 5a I tCE*T 

0. 5a I lOXLigat ion Buffer 

la 1 pCRII vector 

0. 5 /z 1 T4 DNA Ligase 

1.5a 1 dHrO 

[0 0 7 1 ] UTCT — foKf&Zittz. f(DK^^2 

ai£*B§^:3>fc:^> hir;t/ (JHi09) 25a ncJta*.. 30 

ftm*±iz*>\,*fcm* 42t:T3o^rBit- hv 3 v 

*-bT2*rPi>muiL*:!£, soB^%^450a iira*.T 

200rpm37T:TII$^iI<>: Ctl^, Arap/Xgal/ 

IPTG yi/-h(C$f , STr-C-^SIM^IIUfc. 20 

[0 0 7 2] (2-b) cDNA CD^n — 

ABI &$3<E> DNA Sequencer 377 ^ffl^iyoh^-^ 
lC$£oTfTft o fc„ f>-^r>XS^«I^tt© Dye Terni 
nater Cycle Sequencing 4 1 hKJcD Ml 3 ^7^7- 

— ^r>Xor>j^^PcsA3JCg|LTH> Ri;PcsA3(Oit 
3, 4#M) , PcsA3CD 3 ^^^^r^tf^n — > (Pes 

A3-682) <D&&mw&ztz<o&mmmfr£>^m2nz>T 

S^J#^5, 6IC^-T. PcsA3<D5* $1 
8i*&£:^tr#Jtf>^ D-><D 5 ' (PcsA3-5* ) £>&g 
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m &i r c: <d & s ffi * ■£> ^ s £ n * r ^ / m m m s , 

(PcsA3-3') O&gffi^JRtf C<D&ggE3*J^^S2n 

57^yis^j$, s^s§9, i o izmT. pcsai.pc 

sA2i'KLT«. nZtLfzmnJi O0RF£^tf^g^£<D5 , 
jfettiS^tfS' ^a&O:/^ "T-Sr^ffcL. PCR 

■7 — te, PcsAlffllC«ffi^i#^ 2 1 (5 # 5fc38) RtfgE^J 
M2 2 (3* 5fc$S) IC^f^U r/^^ V>*^ Pes 
A2ffltC«SB^J#^2 3 (5' mffi) R^S^jf^2 4 

[0 0 7 3] 

[0 0 7 4] 

: 1 

ffi^JCD^^ : 3207 

le^l^li : cDNA to idRNA 

: *7 $ (Gossypiun hirsutun L.) 
: Coker312 

^^$r^-THB# : CDS 
ntt&m : 77. . 3001 



GGTTAGCATA TTGTTTGTAG CATTGGGTTT TTTTCTCAAG GAAGAAGAAG GAGAAAGATA 60 

AGTAATGTTT ttgaga atg atg gaa tct ggg gtt CCT GTT TGC CAC ACT 109 

Met Met Glu Ser Gly Vat Pro Val Cys His Tnr 

1 5 10 

TGT GGT GAA CAT GTT GGG TTG AAT GTT AAT GGT GAA CCC TTT GTG GCT 157 

Cys Gly Glu His Val Gly Leu Asn Val Asn Gly Glu Pro Ptae Val Ala 

15 20 25 

TGC CAT GAA TGT AAT TTC CCT ATT TGT AAG ACT TGT TTT GAG TAT GAT 205 

Cys His Glu Cys Asn Phe Pro lie Cys Lys Ser Cys Pne Glu Tyr Asp 

30 35 40 

CTT AAG GAA GGA CAA AAA GCT TGC TTG CGT TGT GGT ATT CCG TAT GAT 253 

Leu Lys Glu Gly Gin Lys Ala Cys Leu Arg Cys Gly He Pro Tyr Asp 

45 50 55 

GAA AAC CTG TTG GAC GAT GTC GAG AAG GCC ACC GGC GAT CAA TCG ACA 301 

Glu Asn Leu Leu Asp Asp Val Glu Lys Ala Thr Gly Asp Gin Ser Thr 
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AIG GCT GCA CAT TTG AGC AAG TCT CAG. GAT GTT CGA ATT CAT GCA AGA 349 
Mel Ala Ala His Lea Ser Lys Ser Gin Asp Val Gly lie His Ala Arg 

80 85 9() 

CAT ATC AGC ACT GTG TCT ACA TTG GAT ACT CAA ATG ACT GAA GAC AAT 397 
His He Ser Ser Val Ser Thr Leu Asp Ser Glu Mel Tbr Gin Asp Asn 

95 100 105 

GGG AAT CCG ATT TGG AAG AAC AGC GTC GAA AGT TGG AAA GAA AAG AAG 445 
Gly Asn Pro lie Trp Lys Asn Arg Val Gin Ser Trp Lys Gin Lys Lys 

UO us 120 

AAC AAG AAC AAG AAG CCT GCA ACA ACT AAG GTT GAA AGA GAG GCT GAA 493 
Asn Lys Lys Lys lys Pro Ala Thr Thr Lys Val Gin Arg Gin Ala Gin 
125 130 ,35 

ATC CCA CCT GAG CAA CAA ATG GAA GAT AAA CCG GCA CCG GAT GCT TCC 541 

He Pro Pro Gin Gin Gin Mel Gin Asp Lys Pro Ala Pro Asp Ala Ser 

140 l« 150 ,55 

CAG CCC CTC KG ACT ATA ATT CCA ATC CCG AAA AGC AGA CTT GCA CCA 589 

Gin Pro Len Ser Thr He lie Pro He Pro Lys Ser Arg Len Ala Pro 

•00 165 no 

TAC CGA ACC GTG ATC ATT ATG CGA TTG ATC ATT CTC GGT CTT TTC TTC 637 
Tyr Arg Thr Val lie He Mel Arg Len He lie Leo Gly Len Phe Phe 

175 180 |85 

CAT TAT CGA GTA ACA AAC CCC GTT GAC AGT GCT TTT GGA CTG TGG CTC 685 
His Tyr Arg Val Thr Asn Pro Val Asp Ser Ala Phe Gly Leu Trp Len 

190 195 2 oo 

ACT TCA GTC ATA TGT GAA ATC TGG TTT GCT TTT TCC TGG GTG TTG GAT 733 
Thr Ser Val He Cys Cln He Trp Phe Ala Phe Ser Trp Val Len Asp 

205 2 |0 215 

CAG TTC CCT AAG TGG TAT CCT GTT AAC AGG GAA ACA TAC ATT GAC AGA 781 
Gin Phe Pro Lys Trp Tyr Pro Val Asn Arg Gin Thr Tyr lie Asp Arg 
220 »5 230 235 

CTG TCT GCA AGA TAT GAA AGA GAA GGT GAA CCT AAT GAA CTT GCT GCA 829 
len Ser Ala Arg Tyr Gin Arg Gin Gly Gin Pro Asn Gin Leo Ala Ala 

2 <° 245 250 

GTT GAC TTC TTT GTG AGT ACA GTG GAT CCA TTG AAA GAG CCT CCA TTG 877 
Val Asp Phe Phe Val Ser Thr Val Asp Pro len Lys Glu Pro Pro Len 

255 260 265 

ATT ACT GCC AAT ACT GTG CTT TCC ATC CTT GCC TTG GAC TAC CCG GTA 925 
He Thr Ala Asn Thr Val Len Ser He Len Ala Len Asp Tyr Pro Val 

270 275 280 

GAT AAG GTC TCT TCT TAT ATA TCT GAT GAT GGT GCG GCC ATG CTG ACA 973 
Asp Lys Val Ser Cys Tyr He Ser Asp Asp Gly Ala Ala Met Len Thr 

285 290 295 

TTT GAA TCT CTA GTA GAA ACA GCC GAC TTT GCA AGA AAG TGG GTT CCA 1021 
Phe Glu Ser Len Val Glu Thr Ala Asp Phe Ala Arg Lys Trp Val Pro 
300 305 310 315 

TTC TGC AAA AAA TTT TCC ATT CAA CCA CGG GCA CCT GAG TTT TAC TTC 1069 
Phe Cys Lys Lys Phe Ser He Gin Pro Arg Ala Pro Glo Phe Tyr Phe 

320 325 330 

TCA CAG AAG ATT GAT TAC TTG AAA CAT AAA GTG CAG CCC TCT TTT GTA I 117 
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Ser Gin Lys 

AAA GAA CGT 
Lys GId Arg 
350 

ATC AAT GCT 
I I e Asd Ala 
365 

ACA ATG CAA 
Tbr Met Gin 
380 

CCT GGC ATG 
Pro Gly Met 

GAA GGA AAT 
Gin Gly Asn 

CCT GGC TAC 
Pro Gly Tyr 
430 

AGG GTG TCT 
Arg Val Ser 
445 

TGT GAC CAC 
Cys Asp His 
460 

TTC TTG ATG 
Phe Leo Met 

CCT CAA AGA 
Pro Gin Arg 

AAC ACA GTT 
Asn Tbr Val 
510 

GGG CCT GTT 
Gly Pro Val 
525 

TAT GGC TAT 
Tyr Gly Tyr 
540 

TCA TCT TGC 
Ser Ser Cys 

TCA GAG CTT 
Ser Gin Len 

TTT AAC CTT 
Pbe Asn Len 
590 



1 1 e Asp Tyr 

335 

AGA GCT ATG 
Arg Ala Met 

TTA GTT GCA 
Len Val Ala 

GAT GGA ACT 
Asp Gly Thr 

385 

ATT CAG GTT 
lie Gin Val 
400 

GAA CTT CCC 
Gin Len Pro 
415 

CAA CAC CAC 
Gin His His 

GCA GTT CTT 
Ala Val Leu 

TAT GTT AAC 
Tyr Val Asn 
465 

GAC CCA CAA 
Asp Pro Gin 
480 

TTT GAT GGC 
Phe Asp Gly 
495 

TTC TTT GAT 
Phe Phe Asp 

TAT GTG GGA 
Tyr Val Gly 

GGT CCA CCT 

Gly Pro Pro 

- 545 

TCG TGT TGC 

Ser Cys Cys 
560 

TAT, AGG GAT 

Tyr Arg Asp 
575 

AGG GAA ATT 

Arg Gin lie 



Leo Lys 

AAA AGA 
Lys Arg 
355 
AAG GCT 
Lys Ala 
370 

CCT TGG 
Pro Trp 

TTC CTT 
Phe Len 

CGA CTG 
Arg Len 

AAA AAG 
Lys Lys 
435 
ACA AAT 
Thr Asn 
450 

AAT AGC 
Asn Ser 

GTC GGT 
Val Gly 

ATA GAT 
I le Asp 

GTT AAC 
Val Asn 
515 
ACA GGT 
Thr Gly 
530 

TCA ATG 
Ser Mel 



( II ) 

Asp Lys 
340, 

GAT TAC 

Asp Tyr 

CAG AAA 

Gin Lys 

CCG GGA 

Pro Gly 

GGA TAT 

Gly Tyr 
405 

GTT TAC 

Val Tyr 
420 

GCT GGT 

Ala Gly 

GCT CCC 

Ala Pro 

AAG GCA 

Lys Ala 

CGA GAT 

Arg Asp 
485 

AGG AGT 

Arg Ser 
500 

ATG AAA 

Met Lys 

TGT GTT 

Cys Val 

CCA AGT 

Pro Ser 
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Val Gin Pro Ser Phe Val 

345 

GAA GAG TAC AAA ATT CGA 

Gin Gin Tyr Lys He Arg 
360 

GAA GAA GGA TGG 

Gin Glu Gly Trp 



TGC CCC GGC AAG 
Cys Pro Gly Lys 

565 

GCA AAA CGG GAA 
Ala Lys Arg Gin 
580 

GAC AAT TAT GAT 
Asp Asn Tyr Asp 

595 



ACA CCT 
Thr Pro 
375 
AAT AAC 
Asn Asn 
390 

AGC GGT 
Ser Gly 

GTC TCT 
Val Ser 

GCT GAA 
Ala Gin 

TTC ATC 
Pbe lie 
455 
GTT AGG 
Val Arg 
4 70 

GTC TGC 
Val Cys 

GAT CGA 
Asp Arg 

GGT CTT 
Gly Len 

TTC AAT 
Pbe Asn 
535 
TTT CCC 
Phe Pro 
550 

AAG GAA 
Lys Glu 

GAA CTT 
Gin Len 

GAG TAT 
Gin Tyr 



CCG CGT 
Pro Arg 

GCT CAT 
Ala His 

AGA GAG 
Arg Gin 
425 
AAT GCT 
Asn Ala 
440 

CTC AAT 
Leu Asn 

GAG GCA 
Glu Ala 

TAT GTG 
Tyr Val 

TAT GCC 
Tyr Ala 
505 
GAT GGA 
Asp Gly 
520 

AGG CAA 
Arg Gin 

AAG TCA 
Lys Ser 

CCT AAA 
Pro Lys 

GAT GCT 
Asp Ala 
585 
GAA AGA 
Gin Arg 
600 



GAT CAC 
Asp His 
395 
GAC ATC 
Asp lie 
410 

AAG AGA 
Lys Arg 

TTG GTT 
Leu Val 

CTT GAT 
Len Asp 

ATG TGC 

Met Cys 
475 

CAG TTT 

Gin Phe 
490 

AAT CGG 

Asn Arg 

ATC CAA 
He Gin 

GCA CTT 
Ala Len 

TCC TCC 

Ser Ser 
555 

GAT CCA 

Asp Pro 
570 

GCC ATC 

Ala He 

TCA ATG 
Ser Met 



1165 



1213 



1261 



1309 



1357 



1405 



1453 



1501 



1549 



1597 



1645 



1693 



1741 



1789 



1837 



1885 



( 12 ) 
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TTG ATC TCT CAA 
Lea lie Ser Gin 
605 

TTC ATT GAA TCT 
Phe He Glo Ser 
620 

AAC CCT TCC ACA 
Asn Pro Ser Thr 
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22 



TAT GAG 
Tyr Glo 

TCA GTC 
Ser Val 

TGG AGA 
Trp Arg 
685 
GCA CCC 
Ala Pro 
700 

CTT GGA 
Lea Gly 



GAG AAG 
G!u Lys 
655 
ACT GAG 
Thr Gin 
670 

TCG ATT 
Ser He 

ATC AAT 
1 1 e Asn 

TCT GTT 
Ser Val 



ACA AGC 
Thr Ser 

ACA CTA 
Thr Lea 
625 
CTA ATC 
Leu He 
640 

ACT GCA 
Thr Ala 

GAT ATC 
Asp He 

TAC TGC 
Tyr Cys 



GGC TTT 
Gly Phe 

AAC ACC 
Asn Thr 

TCA CTA 
Ser Len 
765 
CTC TCA 
Leo Ser 
780 

ATC GTG 
lie Va! 



GGA GGT 
Gly Gly 
735 
ATT GTC 
He Val 
750 

CCA GCA 
Pro Ala 

AAC CTG 
Asn Len 

ACT GCT 
Thr Ala 



CTG TCT 
Leu Ser 
705 
GAA ATT 
Glu He 
720 

GGT CGT 
Gly Arg 



TTT GAG AAA 
Phe Glu Lys 
610 

ATG GAG AAT 
Met Glu Asn 

AAG GAA GCA 
Lys Glu Ala 

TGG GGG AAA 
Trp Gly Lys 
660 

TTA ACC GGC 
Lea Thr Gly 

675 
ATG CCC TTA 
Met Pro Leu 
690 

GAT CGG TTG 
Asp Arg Leu 



ACT TTT 
Thr Phe 

GGA GGA 
Gly Gly 
630 
ATT CAT 
He His 
645 

GAG ATT 
Glu He 

TTC AAA 
Phe Lys 

AGG CCA 
Arg Pro 



TTC CTA AGC 
Phe Leu Ser 



TAT CCT 
Tyr Pro 

ATC TGT 
He Cys 



GAC TTA 
Asp Leu 

CAT CTC 
His Leu 

GAC ACC 
Asp Thr 
845 
GGT GAG 
Gly Glu 



TGG CGT 
Trp Arg 
815 
TTT GCC 
Phe Ala 
830 

AAC TTT 
Asn Phe 

CTC TAC 
Len Tyr 



GCA AGT 
Ala Ser 
785 
GTT CTC 
Val Leu 
800 

AAC GAG 
Asn Glu 

GTC TTC 
Val Phe 

ACT GTC 
Thr Val 

ATT GTG 
He Val 



CTT AAA TGG 
Leu Lys Trp 
740 

TTC ACA TCC 
Phe Thr Ser 
755 

CTT CTC ACA 
Len Leu Thr 
770 

GTT CTC TTT 
Val Leu Phe 

GAG CTC CGA 
Glu Leu Arg 

CAG TTT TGG 
Gin Phe Trp 
820 

CAA GGT TTC 
Gin Gly Phe 
835 

ACT GCC AAA 
Thr Ala Lys 
850 

AAA TGG ACT 
Lys Trp Thr 



CAC CAG 
His Gin 
710 
AGG CAT 
Arg His 
725 

CTT CAA 
Leu Gin 

CTT CCA 
Leu Pro 

GGA AAA 
Gly Lys 

CTT GGC 
Leu Gly 
790 
TGG AGT 
Trp Ser 
805 

GTC ATC 
Val He 



GGC TTA 
Gly Leu 
615 

GTG GCT 
Val Ala 

GTC ATC 
Val He 

GGA TGG 
Gly Trp 

ATG CAC 
Mel His 
680 
GCA TTC 
Ala Phe 
695 

GTT CTT 
Val Leu 



TCT TCA GTC 
Ser Ser Val 

GAA TCT GCC 
Glu Ser Ala 
635 

GGC TGT GGC 
Gly Cys Gly 

650 
ATA TAT GGT 
He Tyr Gly 
665 

TGC CGA GGA 
Cys Arg Gly 

AAA GGA TCT 
Lys Gly Ser 



TGC CCT 
Cys Pro 



AGA CTA 
Arg Leu 



CTC ATT 
Leu He 
760 
TTT ATC 
Phe He 
775 

CTT TTC 
Leu Phe 



GGT GTC 
Gly Val 



GGT GGC 
Gly Gly 



CTT AAG 
Leu Lys 

GCA GCT 
Ala Ala 

ACA CTT 
Thr Leu 



ATG CTT 
Met Leu 
840 
GAT GAT 
Asp Asp 
855 

CTA ATC 
Leu He 



CGA TGG GCT 
Arg Trp Ala 
715 

CTA TGG TAT 
Leu Trp Tyr 

730 
GCA TAT ATA 
Ala Tyr He 
745 

GCC TAT TGT 
Ala Tyr Cys 

ATA CCA ACG 
He Pro Thr 

CTT TCC ATT 
Leu Ser He 
795 

AGC ATT GAG 
Ser He Glo 
810 

GTT TCA GCC 
Val Ser Ala 
825 

GCG GGC ATT 
Ala Gly He 



GCA GAT TTT 
Ala Asp Phe 

CCT CCA ACA 
P.ro Pro Thr 



1933 



1981 



2029 



2077 



2125 



2173 



2221 



2269 



2317 



2365 



2413 



2461 



2509 



2557 



2605 



2653 



2701 
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23 24 
860 865 870 875 

ACA CTC CTC ATC GTC AAC ATG GTT GGT -GTC GTT GCC GGA TTC TCC GAT 2749 
Thr Lea Leo He Val Asn Mel Val Gly Val Val Ala Gly Phe Ser Asp 

880 885 890 

GCC CTC AAC AAA GGG TAC GAA GCT TGG GGA CCA CTC TTT GGC AAA GTG 2797 
Ala Leo Asn Lys Gly Tyr Glu Ala Trp Gly Pro Leu Phe Gly Lys Val 

895 900 905 

TTC TTT TCC TTC TGG GTC ATC CTC CAT CTT TAT CCA TTC CTC AAA GGT 2845 
Phe Phe Ser Phe Trp Val He Lea His Leu Tyr Pro Phe Leu Lys Gly 

910 915 920 

CTT ATG GGA CGC CAA AAC AGG ACA CCA ACC ATT GTT GTC CTT TGG TCA 2893 
Lea Mel Gly Arg Gin Asn Arg Thr Pro Thr He Val Val Leu Trp Ser 

925 930 935 

GTG TTG TTG GCT TCT GTC TTC TCT CTT GTT TGG GTT CGG ATC AAC CCG 2941 
Val Leu Leu Ala Ser Val Phe Ser Leu Val Trp Val Arg He Asn Pro 
940 945 950 955 

TTT GTC AGC ACC GCC GAT AGC ACC ACC GTG TCA CAG AGC TGC ATT TCC 2989 
Phe -Val Ser Thr Ala Asp Ser Thr Thr Val Ser Gin Ser Cys He Ser 

960 965 970 

ATT GAT TGT TGATGATATT ATGTGTTTCT TAGAATTGAA ATCATTGCAA 3038 
He Asp Cys 

GTAAGTGGAC TGAAACATGT CTATTGACTA AGTTTTGAAC AGTTTGTACC CATTTTATTC 3098 
TTAGCAGTGT GTAATTTTCC TAAACAATGC TATGAACTAT ACATATTTCA TTGATATTTA 3158 
CATTAAATGA AACTACATCA GTCTGCAGAA AAAAAAAAAA AAAAAAAAA 3207 
[0 0 7 5] ffi^J#§: 2 h#ny— 

ffi£U(Dg2 : 974 mmcomm * 

mm 

Met Met Glu Ser Gly Val Pro Val Cys His Thr Cys Gly Glu His Val 

1 5 10 15 

Gly Leu Asn Val Asn Gly Glu Pro Phe Vai Ala Cys His Glu Cys Asn 

20 25 30 

Phe Pro He Cys Lys Ser Cys Phe Glu Tyr Asp Leu Lys Gin Gly Gin 

35 40 45 

Lys Ala Cys Leu Arg Cys Gly He Pro Tyr Asp Glu Asn Leu Leu Asp 

50 55 60 

Asp Val Glu Lys Ala Thr Gly Asp Gin Ser Thr Met Ala Ala His Leu 
65 70 75. 80 

Ser Lys Ser Gin Asp. Val Gly He His Ala Arg His lie Ser Ser Val 

85 90 95 

Ser Thr Leu Asp Ser Glu Met Thr Glu Asp Asn Gly Asn Pro He Trp 

1 00 105 HO 

Lys Asn Arg Val Glu Ser Trp Lys Glu Lys Lys Asn Lys Lys Lys Lys 

H5 120 125 

Pro Ala Thr Thr Lys Val Glu Arg Glu Ala Glu He Pro Pro Glu Gin 

130 135 140 

Gin Met Glu Asp Lys Pro Ala Pro Asp Ala Ser Gin Pro Leu Ser Thr 
145 150 155 160 

He He Pro He Pro Lys Ser Arg Leu Ala Pro Tyr Arg Thr Val He 

165 170 175 
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25 26 
He Met Arg Leu He lie ten Cly Leo Pbe The His Tyr Arg Val Tbr 

180 18$ 190 

Asn Pro Val Asp Ser Ala Phe Gly Lea Trp Leo Tar Ser Val lie Cys 

'95 200 205 

Glo lie Trp Phe Ala Phe Ser Trp Val Lea Asp Gla Phe Pro Lys Trp 

210 215 220 

Tyr Pro Val Asn Arg Gin Thr Tyr He Asp Arg Len Ser Ala Arg Tyr 
225 "0 235 240 

Gin Arg Gin Gly Gin Pro Asn Gin Len Ala Ala Val Asp Phe Phe Val 

245 250 255 

Ser Thr Val Asp Pro leu Lys Gin Pro Pro Len He Thr Ala Asn Thr 

260 265 2 7 o 

Val len Ser He Len Ala Len Asp Tyr Pro Val Asp Lys Val Ser Cys 

275 280 285 

Tyr lie Ser Asp Asp Gly Ala Ala Mel Len Thr Phe Gin Ser Len Val 
290 295 300 

Gin Thr Ala Asp Phe Ala Arg Lys Trp Val Pro Phe Cys Lys Lys Phe 

805 sin , , r 

JIU 315 320 

Ser lie Gin Pro Arg Ala Pro Gin Phe Tyr Phe Ser Gin lys lie Asp 

325 330 

Tyr Leu Lys Asp Lys Val Gin Pro Ser Phe Val Lys Gin Arg Arg Ala 

340 345 3so 

Mel lys Arg Asp Tyr Gin Gin Tyr Lys He Arg He Asn Ala Leu Val 

355 360 3 6 5 

Ala Lys Ala Gin Lys Thr Pro Glu Gin Gly Trp Thr Met Gin Asp Gly 

370 375 380 

Thr Pro Trp Pro Gly Asn Asn Pro Arg Asp His Pro Gly Met lie Gin 

385 390 395 400 

Val Phe Leu Gly Tyr Ser Gly Ala His Asp lie Glu Gly Asn Gin Leu 

405 HO 415 

Pro Arg Leu Val Tyr Val Ser Arg CIu lys Arg Pro Gly Tyr Gin His 

420 425 430 

His Lys Lys Ala Gly Ala Gin Asn Ala leu Val Arg Val Ser Ala Val 

435 440 445 

Len Thr Asn Ala Pro Phe He Leu Asn leu Asp Cys Asp His Tyr Val 

450 455 460 

Asn Asn Ser lys Ala Val Arg Gin Ala Met Cys Phe Len Met Asp Pro 
465 "70 47 5 4g0 

Gin Val Gly Arg Asp Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp 

485 490 495 

Gly lie Asp Arg Ser Asp Arg Tyr Ala Asn Arg Asa Thr Val Phe Phe 

500 505 5, 0 

Asp Val Asa Met Lys Gly Len Asp Cly lie Gin Gly Pro Val Tyr Val 

515 520 525 

Gly Thr Gly Cys Val Phe Asn Arg Gin Ala Len Tyr Gly Tyr Gly Pro 

530 535 540 

Pro Ser Mel Pro Ser Phe Pro Lys Ser Ser Ser Ser Ser Cys Ser Cys 

545 550 555 560 

Cys Cys Pro Gly Lys Lys Glu Pro Lys Asp Pro Ser Gin Len Tyr Arg 

565 S 7o 575 
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27 28 
Asp Ala Lys Arg Glu Gla Lea Asp Ala Ala lie Phe Asn Leo Arg Glu 

580 585- 590 

lie Asp Asn Tyr Asp Glu Tyr Gla Arg Ser Met Lea lie Ser Gin Thr 

595 600 605 

Ser Phe Gla Lys Thr Phe Gly Leu Ser Ser Val Phe He Glu Ser Thr 

610 615 620 

Leu Met Gla Asn Gly Gly Val Ala Glu Ser Ala Asn Pro Ser Thr Leu 
625 630 635 640 

He Lys Gla Ala He His Val He Gly Cys Gly Tyr Glu Glu Lys Thr 

645 650 655 

Ala Trp Gly Lys Glu He Gly Trp lie Tyr Gly Ser Val Thr Glu Asp 

660 665 670 

He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Arg Ser lie Tyr 

675 680 685 

Cys Met Pro Leu Arg Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu 

690 695 700 

Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu 
705 710 715 720 

He Phe Leu Ser Arg His Cys Pro Leu Trp Tyr Gly Phe Gly Gly Gly 

725 730 735 

Arg Leu Lys Trp Leu Gin Arg Leu Ala Tyr He Asn Thr He Val Tyr 

740 745 750 

Pro Phe Thr Ser Leu Pro Leu He Ala Tyr Cys Ser Leu Pro Ala He 

755 760 765 

Cys Leu Leu Thr Gly Lys Phe He lie Pro Thr Leu Ser Asn Leu Ala 

770 775 780 

Ser Val Leu Phe Leu Gly Leu Phe Leu Ser He He Val Thr Ala Val 
785 790 795 800 

Leu Glu Leu Arg Trp Ser Gly Val Ser lie Glu Asp Leu Trp Arg Asn 

805 810 815 

Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Leu Phe Ala Val 

820 825 830 

Phe Gin Gly Phe Leu Lys Met Leu Ala Gly He Asp Thr Asn Phe Thr 

835 840 845 

Val Thr Ala Lys Ala Ala Asp Asp Ala Asp Phe Gly Glu Leu Tyr He 

850 855 860 

Val Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu Leu He Val 
865 870 875 880 

Asn Met Val Gly Val Val Ala Gly Phe Ser Asp Ala Leu Asn Lys Gly 

885 890 895 

Tyr Glu Ala Trp Gly Pro Leu Phe Gly Lys Val Phe Phe Ser Phe Trp 

900 905 910 

Val He Leu His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg Glu 

915 920 925 

Asn Arg Thr Pro Thr lie Val Val Leu Trp Ser Val Leu Leu Ala Ser 

930 935 940 

Val Phe Ser Leu Yal Trp Val Arg He Asn Pro Phe Val Ser Thr Ala 
945 950 955 960 

Asp Ser Thr Thr Val Ser Gin Ser Cys He Ser He Asp Cys 

965 970 
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29 



[0 0 7 6] : 3 

: 3311 

GZPKDM : 

: 

h*D->*- : 

SJHOtft : cDNA io nRNA 



mm 



^Bf 1 0 
30 

: O ? (Gossypiun b i rso Ion I.) 
: CoXer3l2 

#8fc£:S-rSB*f : CDS 
#a&B : 23.. 3142 
#&£ft5gLJt#fi : S 
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CTTTCGTTCT TTTCGTTTTC CC ATG GCT TCA ACC ACC ATG GCC GCT CGC TTT 

Mel Ala Ser Thr Thr Mel Afa Ala Gly Phe 

1 5 10 

TCA CTT GCI GTT GAG AAT CGG GGA TCA TCG ACA CAT CAA TCA 
Gly Ser Leu Ala Val Asp Glo Asn Arg Gly Ser Ser Tar His Gin Ser 

15 20 25 

TCA ACG AAA ATA TGC AGG GTG TGT GGG GAT AAG ATC GGG CAA AAG GAA 
Ser Thr Lys lie Cys Arg Val Cys Gly Asp Lys Me Gly Gin Lys Glu 

30 35 40 

AAC GGA CAA CCG TTC GTG GCT TGT CAT GTC TGT GCT TTC CCC GTT TGC 
Asn Gly Gin Pro Phe Val Ala Cys His Val Cys Ala Phe Pro Val Cys 

45 50 55 

CGT CCT TGT TAT GAA TAT GAA AGG AGT CAA GGA AAC CAG TGC TGT CCT 
Arg Pro Cys Tyr Gin Tyr Glu Arg Ser Gin Gly Asn Glo Cys Cys Pro 

60 «5 70 

CAG TGC AAT ACT CGC TAT AAG CGT CAC AAA GGT AGT CCA AGA ATT TCA 
Gin Cys Asn Thr Arg Tyr Lys Arg His Lys Gly Ser Pro Arg Me Ser 

75 go tc 

85 90 

CGA GAT GAA CAA GAT GAT TCA GAT CAA GAT GAT TTT GAT GAT GAA TTT 
Gly Asp Gin Glu Asp Asp Ser Asp Glu Asp Asp Phe Asp Asp Glu Phe 

95 100 105 

CAG ATT AAG AAC CGC AAG GAT GAC TCC CAT CCA CAA CAT GAA AAT CAG 
Glu Me Lys Asd Arg Lys Asp Asp Ser His Pro Clu His Glu Asu Glu 

HO 115 no 

GAA TAT AAT AAT AAT AAT CAT CAA TGG CAT CCC AAT GGT CAA GCT TTC 
Glu Tyr Asn Asn Asn Asn His Gin Trp His Pro Asn Gly Gin Ala Phe 

125 130 ,35 

TCA GTT GCC GGA AGC ACG GCC GGG AAG CAT TTG GAA GGG GAT AAA GAG 
Ser Val Ala Gly Ser Thr Ala Gly Lys Asp Leu Glu Gly Asp Lys Glu 
"40 ,45 |50 

ATT TAC GGA AGC GAA GAA TGG AAA GAA AGA GTT CAG AAA TGG AAA GTC 
Me Tyr Gly Ser Clu Glu Trp Lys Glu Arg Val Glo Lys Trp Lys Val 
155 160 165 no 

ACG CAA GAA AAA AGA GGT TTG GTA AGC AAC CAT AAT GCC CCA AAT GAT 
Arg Gin Glu Lys Arg Gly Leu Val Ser Asn Asp Asn Gly Gly Asn Asp 

175 180 , g5 

CCT CCT GAA GAA CAT GAT TAT CTC TTG GCT GAA GCT CGC CAG CCT CTA 
Pro Pro Clu Gin Asp Asp Tyr Len Leu Ala Glu Ala Arg Gin Pro Leu 

190 "95 200 

TGG CGA AAA GTG CCA ATT TCC TCA AGT CTC ATA ACC CCT TAC CGG ATA 
Trp Arg Lys V.I Pro Me Ser Ser Ser Len Me Ser Pro Tyr Arg Me 
205 210 215 



52 



100 



148 



196 



244 



292 



340 



388 



436 



484 



532 



580 



628 



676 
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31 32 

GTC ATC GTC CTC CGA TTC TTC ATC CTC GCA TTT TTC CTC CGG TTC CGT 724 

Val lie Val Lea Arg Phe Phe lie Leu-Ala Phe Phe Leu Arg Phe Arg 

220 225 230 

ATT CTA ACA CCC GCC TAC GAC GCT TAC CCG TTA TGG CTA ATC TCT GTC 772 

He Leu Thr Pro Ala Tyr Asp Ala Tyr Pro Leu Trp Leu lie Ser Val 

235 240 245 250 

ATC TGC GAA GTT TGG TTC GCC TTC TCC TGG ATT CTC GAT CAG TTC CCT 820 

He Cys Glu Val Trp Phe Ala Phe Ser Trp Me Leu Asp Gin Phe Pro 

255 260 265 

AAA TGG TTC CCT ATT ACT CGC GAA ACT TAC CTC GAT CGC CTC TCC TTG *" 868 

Lys Trp Phe Pro He Thr Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu 

270 . 275 280 

AGG TTC GAA CGT GAA GGA GAG CCC AAT CAA CTT GGC CCC GTC GAC GTC 916 

Arg Phe Glu Arg Glu Gly Glu Pro Asn Gin Leu Gly Pro Val Asp Val 

285 290 295 

TTC GTC AGT ACC GTT GAC CTT CTC AAG GAA CCC CCC ATC ATA ACC GCC 964 

Phe Val Ser Thr Val Asp Leu Leu Lys Glu Pro Pro He lie Thr Ala 

300 305 310 

AAC GCG GTT CTA TCG ATC TTG GCC GTC GAT TAC CCG GTC GAG AAA GTG 1012 

Asn Ala Val Leu Ser He Leu Ala Val Asp Tyr Pro Val Glu Lys Val 

315 320 325 330 

TGT TGT TAT GTG TCG GAC GAT GGT GCT TCC ATG CTT CTT TTC GAT TCG 1060 

Cys Cys Tyr Val Ser Asp Asp Gly Ala Ser Met Leu Leu Phe Asp Ser 

335 340 345 

TTG TCT GAA ACG GCT GAG TTC GCG AGG AGA TGG GTT CCG TTT TGT AAG 1108 

Leu Ser Glu Thr Ala Glu Phe Ala Arg Arg Trp Val Pro Phe Cys Lys 

350 355 360 

AAG CAT AAT GTT GAG CCC AGG GCG CCG GAG TTT TAT TTC AAT GAG AAG 1156 

Lys His Asn Val Glu Pro Arg Ala Pro Glu Phe Tyr Phe Asn Glu Lys 

365 370 375 

ATT GAT TAT TTG AAG GAC AAG GTC CAT CCT AGC TTT GTT AAA GAA CGG 1204 

He Asp Tyr Leu Lys Asp Lys Val His Pro Ser Phe Val- Lys Glu Arg 

380 385 390 

AGA GCC ATG AAA AGG GAA TAT GAA GAA TTT AAA GTA AGG ATC AAT GCA 1252 

Arg Ala Mel Lys Arg Glu Tyr Glu Glu Phe Lys Val Arg He Asn Ala 

395 400 405 410 

TTA GTA GCA AAA GCT CAG AAG AAA CCA GAA GAA GGA TGG GTG ATG CAA 1300 

Leu Val Ala Lys Ala Gin Lys Lys Pr.o Glu Glu Gly Trp Val Met Gin 

415 420 425 

GAT GGC ACC CCA TGG CCC GGA AAT AAC ACT CGT GAT CAT CCT GGA ATG 1348 

Asp Gly Thr Pro Trp Pro Gly Asn Asn Thr Arg Asp His Pro Gly Met 

430 435 440 

ATT CAG GTC TAT CTA GGA AGT GCC GGT GCA CTC GAT GTG GAT GGC AAA 1396 

He Gin Val Tyr Leu Gly Ser Ala Gly Ala Leu Asp Val Asp Gly Lys 

445 450 455 

GAG CTG CCT CGA CTT GTC TAT GTT TCT CGT GAG AAA CGA CCT GGT TAT 1444 

Glu Leu Pro Arg Leu Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Tyr 

460 465 470 

CAG CAC CAT AAG AAA GCC GGT GCT GAG AAT GCT CTG GTT CGA GTT TCT 1492 

Gin His His Lys Lys Ala Gly Ala Glu Asn Ala Leu Val. Arg Val Ser 
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475 

GCA GTG CTT ACT AAT 
Ala Val Leu Thr Asn 

495 

TAC ATC AAC AAT AGC 
Tyr He Asn Asn Ser 

510 

GAT CCT CAG TTT GGA 
Asp Pro Gin Phe Gly 

525 

TTT GAT GGT ATT GAT 
Phe Asp Gly He Asp 
540 

TTC TTT GAT ATC AAC 
Phe Phe Asp He Asn 
555 

TAT GTA GGC ACA GGG 
Tyr Val Gly Thr Gly 

575 

GAT CCA CCA GTC TCT 
Asp Pro Pro Val Ser 

590 

CCT TCT TGG TGT TGC 
Pro Ser Trp Cys Cys 
605 

AAG AAA GGT GAA AAG 
Lys Lys Gly Glu Lys 
620 

AAG AAG AAG ATG ATG 
Lys Lys Lys Met Met 
635 

GTC TTT GAT CTC GAA 
Val Phe Asp Leu Gin 

655 

TTG GAG AAA TCG ACA 
Leo Glu Lys Ser Thr 

670 

GGA CAA TCA CCG GTT 
Gly Gin Ser Pro Val 
685 

CTT CCT GAA GGA ACT 
Leu Pro Glu Gly Thr 
700 

GTA ATT AGC TGT GGT 
Val He Ser Cys Gly 
715 

GGA TGG ATT TAT GGG 
Gly Trp He Tyr Gly 

735 

ATG CAT TGT AGA GGG TGG AAA 
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480 

GCA CCC 
Ala Pro 

AAG GCC 
Lys Ala 

AAG AAG 
Lys Lys 

CGT CAT 
Arg His 
545 
ATG TTG 
Met Leu 
560 

TGT GTT 
Cys Val 

GAG AAA 
Glu Lys 

TGT TGT 

Cys Cys 

AAG GGC 
Lys Gly 
625 
GGC AAA 
Gly Lys 
640 

GAA ATC 
Glu He 



TTC ATA 
Phe He 

ATG AGG 
Met Arg 
515 
CTT TGT 
Leu Cys 
530 

GAT CGA 
Asp Arg 

GGA TTA 
Gly Len 

TTC AAC 
Phe Asn 

CGA CCA 
Arg Pro 
595 
TGC GGA 
Cys Gly 
610 

TTA CTC 
Leu Leo 

AAC TAT 
Asn Tyr 



GAA GAA 
Glu Gin 



TTA ATG 
Len Met 

TTC ATT 
Phe He 

AAT TCC 
Asn Ser 
705 
TAT GAA 
Tyr Gin 
720 

TCG GTG 
Ser Val 



TCG CAG 
Ser Gin 
675 
GCC TCA 
Ala Ser 
690 

ACA TCA 
Thr Ser 



485 
TTG AAT 
Leu Asn 
500 

GAA GCG 
Glu Ala 

TAT GTT 
Tyr Val 

TAT GCT 
Tyr Ala 

GAT GGA 
Asp Gly 
565 
AGG CAG 
Arg Gin 
580 

AAG ATG 
Lys Met 

GGT TCT 
Gly Ser 

GGA GGT 
Gly Gly 

GTG AAA 
Val Lys 
645 
GGG CTT 
Gly Leu 
660 

AAG AAT 
Lys Asn 



CTG GAT 
Leu Asp 



ATG TGC 
Met Cys 



CAA TTT 
Gin Phe 
5 35 
AAT CGA 
Asn Arg 
550 

CTT CAA 
Leu Gin 



GCA TTG 
Ala Leu 



ACA TGT 
Thr Cys 



AGG AAG 
Arg Lys 
615 
CTT TTA 
Leu Leu 
630 

AAA GGG 
Lys Gly 



GAA GGA 
Gin Gly 

TTC GAG 
Phe Gin 



ACT TTG 
Thr Leu 

CTG ATT 
Leu He 



ATG GAA 
Met Gia 
695 
AAA GAG 
Lys Glu 
710 

TGG GGC 
Trp Gly 



GAA AAA ACT GAG 
Gin Lys Thr Glu 

725 

ACG GAA GAT ATA TTA ACA 
Thr Gin Asp He Leu Thr 
740 

TCG GTT TAT TGT GTA CCG 



490 

TGT GAT CAT 
Cys Asp His 

505 
TTT TTA ATG 
Phe Leu Met 
520 

CCA CAG AGA 
Pro Gin Arg 

AAT GTT GTC 
Asn Val Val 

GGC CCT GTA 
Gly Pro Val 
570 

TAT GGC TAC 
Tyr Gly Tyr 

585 
GAT TGC TGG 
Asp Cys Trp 
600 

AAA TCA AAG 
Lys Ser Lys 

TAC GGA AAA 
Tyr Gly Lys 

TCT GCA CCA 
Ser Ala Pro 
650 

TAC GAA GAA 
Tyr Gin Gin 
665 

AAA CGA TTC 
Lys Arg Phe 
680 

AAT GGT GGC 
Asn Gly Gly 



GCC ATT CAC 
Ala lie His 

AAA GAG ATC 
Lys Glu He 
730 

GGT TTC AAG 
Gly Phe Lys 
745 

AAA AGA CCG 



1540 



1588 



1636 



1684 



1732 



1780 



1828 



1876 



1924 



1972 



2020 



2068 



2116 



2164 



2212 



2260 



2308 
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35 36 

Met His Cys Arg Gly Trp Lys Ser Yal Tyr Cys Val Pro Lys Arg Pro 

750 755 - 760 

GCA TTC AAA GGG TCC GCT CCA ATC AAT CTC TCG GAT CGG TTG CAC CAA 2356 

Ala Pbe Lys Gly Ser Ala Pro He Asn Lea Ser Asp Arg Leu His Gin 

765 770 775 

GTT TTG AGA TGG GCA CTT GGT TCT GTA GAA ATT TTC CTT ACT CGT CAC 2404 

Val Leo Arg Trp Ala Leo Gly Ser Val GIq lie Phe Lea Ser Arg His 

780 785 790 

TGT CCA CTT TGG TAT GGT TAT GGT GGA AAA CTG AAA TGG CTC GAG AGG 2452 

Cys Pro Leu Trp Tyr Gly Tyr Gly Gly Lys Leu Lys Trp Leu Glu Arg 

795 800 805 810 

CTT GCT TAT ATC AAC ACC ATT GTT TAC CCT TTC ACC TCG ATC CCT TTA 2500 

Leu Ala Tyr lie Asn Thr He Val Tyr Pro Phe Thr Ser He Pro Leu 

815 820 825 

CTC GCC TAT TGT ACT ATT CCA GCT GTT TGT CTT CTC ACC GGC AAA TTC 2548 

Leu Ala Tyr Cys Thr He Pro Ala Val Cys Leu Leu Tbr Gly Lys Phe 

830 835 840 

ATC ATT CCA ACT CTA AGC AAC CTT ACA AGT GTG TGG TTC TTG GCA CTT 2596 

He lie Pro Thr Leu Ser Asn Len Thr Ser Val Trp Pbe Leu Ala Leu 

845 850 855 

TTC CTC TCC ATC ATT GCA ACT GGA GTG CTT GAA CTT CGA TGG AGC GGG 2644 

Phe Leu Ser He He Ala Thr Gly Val Leu Glu Leu Arg Trp Ser Gly 

860 865 870 

GTT AGC ATC CAA GAC TGG TGG CGC AAT GAA CAA TTC TGG GTG ATC GGA 2692 

Val Ser He Gin Asp Trp Trp Arg Asn Glu Gin Phe Trp Val He Gly 

875 880 885 890 

GGT GTC TCC GCC CAT CTT TTT GCT GTC TTC CAG GGC CTC CTC AAA GTC 2740 

Gly Val Ser Ala His Leu Phe Ala Val Phe Gin Gly Leu Leu Lys Val 

895 900 905 

CTA GCT GGA GTA GAC ACC AAC TTC ACC GTA ACA GCA AAA GCA GCA GAC 2788 

Leu Ala Gly Val Asp Thr Asn Phe Thr Val Thr Ala Lys Ala Ala Asp 

910 915 920 

GAT ACA GAA TTC GGT GAA CTT TAT CTC TTC AAA TGG ACA ACT CTC TTA 2836 

Asp Thr Glu Phe Gly Gin Leo Tyr Len Phe Lys Trp Thr Thr Leu Leu 

925 930 935 

ATC CCT CCC ACA ACT CTG ATA ATA CTG AAC ATG GTC GGA GTC GTG GCC 2884 

He Pro Pro Thr Thr Leu He He Leu Asn Met Val Gly Val Val Ala 

940 945 950 

GGA GTT TCA GAC GCA ATC AAC AAC GGC TAT GGT TCA TGG GGT CCA TTG 2932 

Gly Val Ser Asp Ala He Asn Asn Gly Tyr Gly Ser Trp Gly Pro Leu 

955 960 965 970 

TTC GGC AAA CTG TTC TTC GCA TTC TGG GTC ATT CTT CAT CTT TAC CCA 2980 

Phe Gly Lys Leu Phe Phe Ala Phe Trp Val He Leu His Leu Tyr Pro 

975 980 985 

TTC CTC AAA GGT TTG ATG GGG AGA CAA AAC AGG ACG CCC ACC ATT GTT 3028 

Phe Leu Lys Gly Leu Met Gly. Arg Gin Asn Arg Tbr Pro Thr He Val 

990 995 1000 

GTG CTT TGG TCC ATA CTT TTG GCA TCG ATT TTC TCA CTG GTT TGG GTA 3076 

Val Leu Trp Ser He Leu Leu Ala Ser lie Phe Ser Leu Val Trp Val 
1005 1010 1015 
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37 38 
CGG ATC GAT CCC TTC TTG CCC AAA CAA ACA GGT CCA GTT CTT AAA CAA 3124 
Arg lie Asp Pro Phe Leu Pro Lys GlrThr Gly Pro Val Leu Lys Gin 

1 020 1025 1030 

TGT GGC GTG GAG TGC TAAATGGTGT TTTACAAACC TTTCTTATTA TTTTATTTTC 3179 
Cys Gly Val Glu Cys 
1035 

CCTTTTTGCC ACTACTGTTG ATTTGCTGTG ATTCTAAAAG GGATTTATCT TGTTTGTAAA 3239 
AAGTCTCCTA TGATTTTGTT GGTTCAATTT AATTTCTATA TGGTAAAAAA ATATTTCTTT 3299 
AAATTAACTA TA 33n 
[0 0 7 7 ] : 4 10 htf Di ;- : mmm 

bmoa* : 1039 amvmm z*>n*x 

mm<om : r 5 sm 

Met Ala Ser Thr Thr Met Ala Ala Gly Phe Gly Ser Leu Ala Val Asp 

! 5 10 15 

Glu Asn Arg Gly Ser Ser Thr His Gin Ser Ser Thr Lys He Cys Arg 

20 25 30 

Val Cys Gly Asp Lys lie Gly Gin Lys Glu Asn Gly Gin Pro Phe Val 

35 40 45 

Ala Cys His Val Cys Ala Phe Pro Val Cys Arg Pro Cys Tyr Glu Tyr 

50 55 60 

Glu Arg Ser Glu Gly Asn Gin Cys Cys Pro Gin Cys Asn Thr Arg Tyr 
65 ™ 75 80 

Lys Arg His Lys Gly Ser Pro Arg He Ser Gly Asp Gin Gin Asp Asp 

85 90 95 

Ser Asp Gin Asp Asp Phe Asp Asp Glu Phe Gin lie Lys Asn Arg Lys 

100 105 no 

Asp Asp Ser His Pro Gin His Glu Asn Glu Glu Tyr Asn Asn Asn Asn 

115 120 125 

His Gin Trp His Pro Asn Gly Gin Ala Phe Ser Val Ala Gly Ser Thr 

130 135 )40 

Ala Gly Lys Asp Leu Glu Gly Asp Lys Glu lie Tyr Gly Ser Glu Glu 
145 150 155 160 

Trp Lys Glu Arg Val Glu Lys Trp Lys Val Arg Gin Glu Lys Arg Gly 

165 170 175 

Leu Val Ser Asn Asp Asn Gly Gly Asn Asp Pro Pro Glu Glu Asp Asp 

180 185 190 

Tyr Leu Leu Ala Glu Ala Arg Gin Pro Leu Trp Arg Lys Val Pro lie 

I9 5 200 205 

Ser Ser Ser Leu He Ser Pro Tyr Arg He Val He Val Leu Arg Phe 

210 215 220 

Phe He Leu Ala Phe Phe Leu Arg Phe Arg He Leu Thr Pro Ala Tyr 
225 230 235 240 

Asp Ala Tyr Pro Leu Trp Leu He Ser Val lie Cys Glu Val Trp Phe 

245 250 255 

Ala Phe Ser Trp He Leu Asp Gin Phe Pro Lys Trp Phe Pro He Thr 

260 265 270 

Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Phe Glu Arg Glu Gly 

275 280 285 

Glu Pro Asn Gin Leu Gly Pro Val Asp Val Phe Val Ser Thr Val Asp 
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39 40 
290 . 295 300 

Lea Leo Lys Glu Pro Pro lie He Thr-Ala Asn Ala Val Leu Ser He 
305 310 315 320 

Leu Ala Val Asp Tyr Pro Val Glu Lys Val Cys Cys Tyr Val Ser Asp 

325 330 335 

Asp Gly Ala Ser Met Leu Leo Phe Asp Ser Leu Ser Glu Thr Ala Glu 

340 345 350 

Phe Ala Arg Arg Trp Val Pro Phe Cys Lys Lys His Asn Val Glu Pro. 

355 360 365 

Arg Ala Pro Giu Phe Tyr Phe Asn Gin Lys He Asp Tyr Leu Lys Asp 

370 375 380 

Lys Val His Pro Ser Phe Val Lys Glu Arg Arg Ala Mel Lys Arg Glu 
385 390 395 400 

Tyr Glu Gin Phe Lys Val Arg He Asn Ala Leu Val Ala Lys Ala Gin 

405 410 415 

Lys Lys Pro Glu Glu Gly Trp Val Met Gin Asp Gly Thr Pro Trp Pro 

420 425 430 

Gly Asn Asn Thr Arg Asp His Pro Gly Met He Gin Val Tyr Leu Gly 

435 440 445 

Ser Ala Gly Ala Leu Asp Val Asp Gly Lys Glu Leu Pro Arg Leu Val 

450 455 460 

Tyr Val Ser Arg Glu Lys Arg Pro Gly Tyr Gin His His Lys Lys Ala 
465 470 475 480 

Gly Ala Glu Asn Ala Leu Val Arg Val Ser Ala Val Leu Thr Asn Ala 

485 490 495 

Pro Phe He Leu Asn Leu Asp Cys Asp His Tyr lie Asn Asn Ser Lys 

500 505 510 

Ala Met Arg Giu Ala Mel Cys Phe Leu Met Asp Pro Gin Phe Gly Lys 

515 520 525 

Lys Leu Cys Tyr Yal Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg 

530 535 540 

His Asp Arg Tyr Ala Asn Arg Asn Val Val Phe Phe Asp He Asn Met 
545 550 555 560 

Leu Gly Leu Asp Gly Leu Gin Gly Pro Val Tyr Val Gly Thr Gly Cys 

565 570 575 

Val Phe Asn Arg Gin Ala Leu Tyr Gly Tyr Asp Pro Pro Val Ser Glu 

580 585 590 

Lys Arg Pro Lys Met Thr Cys Asp Cys Trp Pro Ser Trp Cys Cys Cys 

595 600 605 

Cys Cys Gly Gly Ser Arg Lys Lys Ser Lys Lys Lys Gly Glu Lys Lys 

610 615 620 

Gly Leu Leu Gly Gly Leu Leu Tyr Gly Lys Lys Lys Lys Met Met Gly 
625 630 635 640 

Lys Asn Tyr Val Lys Lys Gly Ser Ala Pro Val Phe Asp Leu Glu Glu 

645 650 655 

He Glu Glu Gly Leu Glu Gly Tyr Glu Glu Leu Glu Lys Ser Thr Leu 

660 665 670 

Met Ser Gin Lys Asn Phe Glu Lys Arg Phe Gly Gin Ser Pro Val Phe 

675 680 685 

He Ala Ser Thr Leu Met Glu Asn Gly Gly Leu Pro Glu Gly Thr Asn 
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41 42 
690 695 700 

Ser Tbr Ser Len He Lys Gin Ala Ile-His Val He Ser Cys Gly Tyr 
705 710 715 720 

Glu Gin Lys Thr Gin Trp Gly Lys Glo He Gly Trp He Tyr Gly Ser 

72 * 730 735 

Val Thr Glu Asp He Len Thr Gly Phe Lys Met His Cys Arg Gly Trp 

7 *0 745 750 

Lys Ser Val Tyr Cys Val Pro Lys Arg Pro Ala Phe Lys Gly Ser Ala 

755 760 765 

Pro He Asn Leu Ser Asp Arg Leu His Gin Val Leu Arg Trp Ala Leu 

770 775 780 

Gly Ser Val Glu He Phe Len Ser Arg His Cys Pro Leu Trp Tyr Gly 
785 ™0 795 goo 

Tyr Gly Gly Lys Len Lys Trp Leu Glu Arg Leu Ala Tyr He Asn Thr 

805 810 815 

He Val Tyr Pro Phe Thr Ser He Pro Leu Leu Ala Tyr Cys Thr He 

820 825 830 

Pro Ala Val Cys Leu Len Thr Gly Lys Phe He He Pro Thr Leu Ser 

8 35 840 845 

Asn Len Thr Ser Val Trp Phe Leu Ala Len Phe Len Ser He He Ala 

850 855 860 

Thr Gly Val Leu Gin Len Arg Trp Ser Gly Val Ser He Gin Asp Trp 
865 87 ° 875 880 

Trp Arg Asn Glu Gin Phe Trp Val He Gly Gly Val Ser Ala His Len 

885 890 895 

Phe Ala Val Phe Gin Gly Len Leu Lys Val Leu Ala Gly Val Asp Thr 

500 905 910 

Asn Phe Thr Val Thr Ala Lys Ala Ala Asp Asp Thr Gin Phe Gly Glu 

915 920 925 

Leu Tyr Leu Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Leu 

930 935 9 4 o 

He He Leu Asn Met Val Gly Val Val Ala Gly Val Ser Asp Ala He 
945 950 955 960 

Asn Asn Gly Tyr Gly Ser Trp Gly Pro Len Phe Gly Lys Leu Phe Phe 

965 970 975 

Ala Phe Trp Val He Leu His Leu Tyr Pro Phe Len Lys Gly Len. Met 

980 985 990 

Gly Arg Gin Asn Arg Tbr Pro Thr He Val Val Leu Trp Ser He Leu 

995 1000 |005 

Leu Ala Ser He Phe Ser Leu Val Trp Val Arg He Asp Pro Phe Leu 

1010 1015 1020 

Pro Lys Gin Thr Gly Pro Val Leu Lys Gin Cys Gly Val Gin Cys 
1025 1030 |035 

[0 0 7 8] mmmn : 5 : 9 , (GossypiDB hirsntDn 
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43 44 

CCG ACA TTC GTG AAG GAG CGT CGA GCT-ATG AAG AGA GAA TAT GAA GAA 48 

Pro Thr Pbe Val Lys Gin Arg Arg Ala Mel Lys Arg Glu Tyr Glu Glu 

1 5 10 15 

TTC AAG GTT AGG ATA AAT GCA CTT GTA GCC AAA GCC CAA AAG GTT CCT 96 

Phe Lys Val Arg lie Asa Ala Leu Val Ala Lys Ala Gla Lys Val Pro 

20 25 30 

CCA GAA GGG TGG ATC ATG CAA GAT GGG ACA CCA TGG CCA GGA AAC AAT 144 

Pro Glo Gly Trp lie Met Gin Asp G ly Thr Pro Trp Pro Gly Asn Asn 

35 40 45 

ACT AAA GAT CAC CCT GGT ATG ATT CAA GTA TTT CTC GGT CAA ACT GGA 192 

Thr Lys Asp His Pro Gly Met He Gin Val Phe Leu Gly Gin Ser Gly 

50 55 60 

GGC CAT GAT ACC GAA GGA AAT GAG CTT CCT CGT CTC GTC TAT GTA TCT 240 

Gly His Asp Thr Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser 

65 70 75 80 

CGA GAG AAA AGG CCT GGT TTC TTG CAT CAC AAG AAA GCT GGT GCC ATG 288 

Arg Glu Lys Arg Pro Gly Phe Leu His His Lys Lys Ala Gly Ala Met 

85 90 95 

AAC GCC CTT GTT CGG GTC TCG GGG GTG CTC ACA AAT GCT CCT TTT ATG 336 

Asn Ala Leu Val Arg Val Ser Gly Val Leu Thr Asn Ala Pro Phe Met 

1 00 105 no 

TTG AAC TTG GAT TGT GAC CAT TAT TTA AAT AAC AGC AAG GCT GTA AGA 384 

Leu Asn Leu Asp Cys Asp His Tyr Leu Asn Asn Ser Lys Ala Val Arg 

115 120 125 

GAG GCT ATG TGT TTC TTG ATG GAC CCT CAA ATT GGA AGA AAG GTT TGC 432 

Glu Ala Met Cys Phe Leu Met Asp Pro Gin lie Gly Arg Lys Val Cys 

130 135 140 

TAT GTC CAA TTC CCT CAA CGT TTC GAT GGT ATT GAT AGA CAT GAT CGA 480 

Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly lie Asp Arg His Asp Arg 

145 150 155 160 

TAT GCC AAT CGG AAC ACA GTT TTC TTT GAT ATT AAC ATG AAA GGT CTA 528 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp lie Asn Met Lys Gly Leu 

165 170 175 

GAT GGT ATA CAA GGC CCT GTA TAT GTC GGC ACG GGG TGT GTT TTC AGA 576 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Arg 

180 185 190 

AGG CAA GCT CTT TAT GGT TAT GAA CCT CCA AAG GGA CCT AAG CGC CCG 624 

Arg Gin Ala Leu Tyr Gly Tyr Glu Pro Pro Lys Gly Pro Lys Arg Pro 

195 200 205 

AAA ATG GTA ACC TGT GGT TGC TGC CCT TGT TTT GGA CGC CGC AGA AAG 672 

Lys Met Val Thr Cys Gly Cys Cys Pro Cys Phe Gly Arg Arg Arg Lys 

210 215 220 

GAC AAA AAG CAC TCT AAG GAT GGT GGA AAT GCA AAT GGT CTA AGC CTA 720 

Asp Lys Lys His Ser Lys Asp Gly Gly Asn Ala Asn Gly Leu Ser Leu 

225 230 235 240 

GAA GCA GCC AAA GAT GAC AAG GAG TTA TTG ATG TCC CAC ATG AAC TTT 768 

Glu Ala Ala Lys Asp Asp Lys Glu Leu Leu Met Ser His Met Asn Phe 

245 ' 250 255 

GAA AAG AAA TTT GGA CAA TCA GCC ATT TTT GTA ACT TCA ACA CTG ATG 816 



< 24 > WmW- 10-276782 

45 46 
Glo Lys Lys Phe Gly Gin Ser Ala He Pbe Val Thr Ser Thr Leu Net 

260 265- 270 

GAA CAA GGT GGT GTC CCT CCT TCT TCA AGC CCC GCA GCT TTG CTC AAA 864 
Glo Gin Gly Gly Val Pro Pro Ser Ser Ser Pro Ala Ala Leu Leu Lys 

275 280 285 

GAA GCC ATT CAT GTA ATT AGT TGT GGT TAT GAA GAC AAA ACA GAA TGG 912 
Glu Ala He His Val He Ser Cys Gly Tyr Glu Asp Lys Thr GIu Trp 

290 295 300 

GGA AGC GAG CTT GGC TGG ATT TAC GGC TCG ATT ACA GAA GAT ATC TTA 960 
Gly Ser Glo Leu Gly Trp He Tyr Gly Ser He Tbr Glu Asp He Leu 
305 310 315 320 

ACA GGA TTC AAG ATG CAT TGC CGT GGA TGG AGA TCA ATA TAC TGC ATG 1008 
Thr Gly Phe Lys Mel His Cys Arg Gly Trp Arg Ser He Tyr Cys Met 

325 330 335 

CCA AAG TTG CCT GCA TTC AAG GGT TCA GCT CCC ATC AAT CTA TCG GAT 1056 
Pro Lys Leu Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp 

340 345 350 

CGT CTA AAC CAA GTC CTT CGA TGG GCA CTC GGT TCT GTT GAA ATT TTC 1104 
Arg Leo Asn Gin Val Leu Arg Trp Ala Leo Gly Ser Val Glu He Phe 

355 360 365 

TTT AGT CAT CAT TGC CCA GCA TGG TAT GGT TTC AAG GGA GGA AAG CTA 1152 
Phe Ser His His Cys Pro Ala Trp Tyr Gly Phe Lys Gly Gly Lys Leu 

370 375 380 

AAA TGG CTT GAA CGA TTC GCA TAT GTC AAC ACA ACC ATC TAC CCC TTC 1200 
Lys Trp Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr He Tyr Pro Phe 
385 390 395 400 

ACA TCT TTA CCA CTT CTC GCC TAT TGT ACC CTA CCG GCA ATC TGT TTA 1248 
Thr Ser Leu Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala He Cys Leu 

405 410 415 

CTT ACC GAT AAA TTT ATC ATG CCA CCG ATA AGC ACC TTT GCA AGT CTA 1296 
Leu Thr Asp Lys Phe He Met Pro Pro He Ser Thr Phe Ala Ser Leu 

420 425 430 

TTC TTC ATT GCC TTG TTT CTT TCA ATC TTT GCA ACT GGT ATT CTC GAG 1344 
Phe Phe He Ala Leu Phe Leu Ser He Phe Ala Thr Gly He Leu Glu 

435 440 445 

CTA AGG TGG AGT GGA GTA AGC ATT GAA GAA TGG TGG AGG AAT GAG CAA 1392 
Leu Arg Trp Ser Gly Val Ser He Glu Glu Trp Trp Arg Asn Glu Gin 

450 455 460 

TTT TGG GTC ATC GGT GGC ATT TCG GCA CAT TTG TTC GCT GTT ATC CAA 1440 
Phe Trp Val He Gly Gly He Ser Ala His Leu Phe Ala Val He Gin 
465 470 475 480 

GGC TTG TTG AAA GTT CTA GCT GGT ATT GAC ACT AAT TTC ACT GTC ACA 1488 
Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr 

485 490 495 

TCC AAG GCA ACT GAT GAC GAG GAG TTC GGG GAA TTG TAT ACT TTC AAA 1536 
Ser Lys Ala Thr Asp Asp Glu Glu Phe Gly Glu Leu Tyr Thr Phe Lys 

500 505 510 

TGG ACA ACC CTT CTA ATT CCT CCT ACT ACC GTC TTA ATC ATC AAT TTA 1584 
Trp Tbr Thr Leu Leu He Pro Pro Thr Thr Val Leu He He Asn Leu 
515 520 525 
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47 48 

GTC GGT GTC GTT GCA GGC ATC TCG GAT GCC ATA AAC AAT GGA TAC CAA 1632 

Val Gly Val Val Ala Gly He Ser Asp-Ala He Asn Asn Gly Tyr Gin 

530 535 540 

TCA TGG GGA CCT CTT TTT GGG AAG CTC TTC TTC TCT TTC TGG GTG ATT. 1680 

Ser Trp Gly Pro Leo Phe Gly Lys Leo Phe ( Phe Ser Phe Trp Val lie 

545 550 555 560 

GTC CAT CTC TAT CCA TTC CTC AAA GGT TTA ATG GGG AGA CAA AAC CGG . 1728 

Val His Leu Tyr Pro Phe Lea Lys Gly Leb Met Gly Arg Gin Asn Arg 

565 570 575 

ACA CCA ACC ATT GTT GTT ATA TGG TCA GTG CTA TTG GCT TCA ATC TTC 1776 

Thr Pro Thr He Val Val He Trp Ser Val Leu Leo Ala Ser lie Phe 

580 585 590 

TCC TTG CTT TGG GTC CGA ATT GAT CCA TTT GTG ATG AAA ACC AAA GGA 1824 

Ser Leu Leu Trp Val Arg He Asp Pro Phe Val Met Lys Thr Lys Gly 

595 600 605 

CCA GAC ACT ACA ATG TGT GGC ATT AAC TGT TGAAAAAAAA TCATCTTGCG 1874 

Pro Asp Thr Thr Met Cys Gly He Asn Cys 

610 615 
TGGTTCTTTT AGATTATGGT ATGTGATGTA TGAACAAACA AGAATGGAGA TGCACAAGAC 1934 
AGAATAAAAT TAGAGTGAAA GTTTTGTGTA GTTATATATT CATTCTACCA ACTATAAGTT 1994 

TTGTCATTCA ATTGAAAATA GCTCAACTTT GTGATCAAA 2033 . 

[ 0 0 7 9 ] 6 h # n i-?- : ItSitt 

SB^J^>^$:618 &&\<Z>mm : H 
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mm 

Pro Thr Phe Val Lys Gin Arg Arg Ala Met Lys Arg Glu Tyr Glu Glu 

15 10 15 

Phe Lys Val Arg He Asn Ala Leu Val Ala Lys Ala Gin Lys Val Pro 

20 25 30 

Pro Glu Gly Trp He Met Gin Asp Gly Thr Pro Trp Pro Gly Asn Asn 

35 40 45 

Thr Lys Asp His Pro Gly Met He Gin Val Phe Leu Gly Gin Ser Gly 

50 55 60 

Gly His Asp Thr Glu Gly Asn Glu Leu Pro Arg Leu Val Tyr Val Ser 

65 70 75 80 

Arg Glu Lys Arg Pro Gly Phe Leu His His Lys Lys Ala Gly Ala Met 

85 90 95 

Asn Ala Leu Val Arg Val Ser Gly Val Leu Thr Asn Ala Pro Phe Met 

100 105 110 

Leu Asn Leu Asp Cys Asp His Tyr Leu Asn Asn Ser Lys Ala Val Arg 

115 120 125 

Glu Ala Met Cys Phe Leu Met Asp Pro Gin He Gly Arg Lys Val Cys 

130 135 140 

Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp Arg His Asp Arg 

145 150 155 160 

Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp He Asn Met Lys Gly Leu 

165 170 175 

Asp Gly He Gin Gly Pro Val Tyr Val Gly Thr Gly Cys Val Phe Arg 

180 185 190 

Arg Gin Ala Leo Tyr Gly Tyr Glu Pro Pro Lys Gly Pro Lys Arg Pro 
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49 50 
195 200 205 

Lys Net Val Thr Cys Gly Cys Cys Pro Cys Pbe Gly Arg Arg Arg Lys 

210 215 220 

Asp Lys Lys His Ser Lys Asp Gly Gly Asn Ala Asn Gly Leo Ser Leu 
225 230 235 240 

Glu Ala Ala Lys Asp Asp Lys Gin Leu Leu Met Ser His Met Asn Phe 

245 250 255 

Glu Lys Lys Phe Gly Gin Ser Ala lie Phe Val Thr Ser Thr Leu Met 

260 265 270 

Glu Gin Gly Gly Val Pro Pro Ser Ser Ser Pro Ala Ala Leu Leu Lys 

275 280 285 

Glu Ala He His Val Me Ser Cys Gly Tyr Glu Asp Lys Thr Glu Trp 

2 90 295 300 

Gly Ser Glu Leu Gly Trp He Tyr Gly Ser He Thr Glu Asp He Leu 
305 310 3 1 5 320 

Tbr Gly Phe Lys Met His Cys Arg Gly Trp Arg Ser He Tyr Cys Met 

325 330 335 

Pro Lys Leu Pro Ala Phe Lys Gly Ser Ala Pro He Asn Leu Ser Asp 

340 345 350 

Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser Val Glu He Phe 

355 360 365 

Phe Ser His His Cys Pro Ala Trp Tyr Gly Phe Lys Gly Gly Lys Leu 

370 375 380 

Lys Trp Leu Glu Arg Phe Ala Tyr Val Asn Thr Thr He Tyr Pro Phe 
385 390 395 400 

Thr Ser Leu Pro Leu Leu Ala Tyr Cys Thr Leu Pro Ala Me Cys Leu 

405 410 415 

Leu Thr Asp Lys Phe He Met Pro Pro He Ser Thr Phe Ala Ser Leu 

420 425 430 

Phe Phe He Ala Leu Phe Leu Ser He Phe Ala Thr Gly He Leu Glu 

435 440 445 

Leu Arg Trp Ser Gly Val Ser He Glu Glu Trp Trp Arg Asn Glu Gin 

450 455 460 

Phe Trp Val He Gly Gly He Ser Ala His Leu Phe Ala Val He Gin 
465 470 475 480 

Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn Phe Thr Val Thr 

485 490 495 

Ser Lys Ala Thr Asp Asp Glu Glu Phe Gly Glu Leu Tyr Thr Phe Lys 

500 505 510 

Trp Thr Thr Leu Leu He Pro Pro Thr Thr Val Leu He He Asn Leu 

515 520 525 

Val Gly Val Val Ala Gly He Ser Asp Ala He Asn Asn Gly Tyr Gin 

530 535 540 

Ser Trp Gly Pro Leu Phe Gly Lys Leu Phe Phe Ser Phe Trp Val He 
545 550 555 560 

Val His Leu Tyr Pro Phe Leu Lys Gly Leu Met Gly Arg Gin Asn Arg 

565 570 575 

Thr Pro Thr He Val Val He Trp Ser Val Leu Leu Ala Ser He Phe 

580 585 590 

Ser Leu Leu Trp Val Arg lie Asp Pro Phe Val Met Lys Thr Lys Gly 
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595 600 605 

Pro Asp Thr Tbr Met Cys Gly lie Asn Cys 
610 615 
] E^J#*t : 7 
: 1086 

: cDNA to dRNA 

10 

mm 

GGCACGAGCT TTCATATCCT CCA ATG GAA GCC AGC GCC GGA CTC GTT GCG 

Net Gin Ala Ser Ala Gly Lea Val Ala 

1 5 
GGC TCT CAC AAC CGC AAT GAA CTT GTT GTC ATT CAT GGC CAT GAA GAG 
Gly Ser His Asn Arg Asn Gin Len Val Val lie His Gly His Gin Gin 
10 15 20 25 

CCT AAA CCT CTG AAG AAC TTG GAT GGT CAA GTT TGT GAG ATT TGT GGT 
Pro Lys Pro Leu Lys Asn Len Asp Gly Gin Val Cys Gin lie Cys Gly 

30 35 40 

GAT GAA ATT GGG TTG ACG GTC GAT GGA GAT CTT TTC GTG GCC TGC AAC 
Asp Gin He Gly Len Thr Val Asp Gly Asp Len Phe Val Ala Cys Asn 

45 50 55 

GAG TGT GGT TTT CCA GTT TGT AGG CCT TGT TAT GAG TAT GAA AGG AGA 
Gin Cys Gly Phe Pro Val Cys Arg Pro Cys Tyr Glu Tyr Glu Arg Arg 
60 65 70 

- GAA GGG ACT CAA CAA TGT CCT CAA TGC AAA ACT AGA TAC AAG CGT CTC 
Gin Gly Ser Gin Gin Cys Pro Gin Cys Lys Thr Arg Tyr Lys Arg Len 

75 80 85 

AAG GGG AGT CCG AGG GTG GAG GGA GAT GAA GAT GAA GAG GAT GTG GAT 
Lys Gly Ser Pro Arg Val Gin Gly Asp Glu Asp Glu Glu Asp Val Asp 
90 95 100 105 

GAT ATC GAA CAT GAA TTC AAC ATT GAT GAT GAA CAA AAC AAG TAT AGA 
Asp lie Gin His Gin Phe Asn He Asp Asp Glu Gin Asn Lys Tyr Arg 

110 115 120 

AAT ATC GCT GAA TCG ATG CTT CAT GGA AAG ATG AGC TAC GGG AGA GGC 
Asn He Ala Glu Ser Mel Leu His Gly Lys Met Ser Tyr Gly Arg Gly 

125 130 135 

CCT GAA GAC GAT GAA GGT TTG CAA ATC CCA CCC GGT TTA GCT GGT GTT 
Pro Glu Asp Asp Glu Gly Leu Gin He Pro Pro Gly Leu Ala Gly Val 

140 145 150 

CGA TCT CGG CCG GTG AGC GGG GAG TTC CCA ATA GGA AGC TCT CTT GCT 
Arg Ser Arg Pro Val Ser Gly Glu Phe Pro He Gly Ser Ser Leu Ala 

155 160 165 

TAT GGG GAA CAC ATG TCA AAT AAA CGA GTT CAT CCA TAT CCT ATG TCT 
Tyr Gly Glu His Met Ser Asn Lys Arg Val His Pro Tyr Pro Met Ser 
170 175 180 185 

GAA CCT GGA AGT GCA AGA TGG GAT GAA AAG AAA GAG GGA GGA TGG AGA 
Gin Pro Gly Ser Ala Arg Trp Asp Gin Lys Lys Gin Gly Gly Trp Arg 

190 195 200 
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386 



434 



482 



530 



578 



626 
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53 54 
GAA AGG ATG GAT GAT TGG AAA ATG CAG CAA GGG AAT TTG GGT CCT GAA 674 
Glu Arg Met Asp Asp Trp Lys Met Gln-Gln Gly Asn Leo Gly Pro Glu 

205 210 215 

CCT GAT GAT GCC TAT GAT GCT GAC ATG GCT ATG CTT GAT GAA GCT AGG 722 
Pro Asp Asp Ala Tyr Asp Ala Asp Met Ala Met Leo Asp Glo Ala Arg 

220 225 230 

CAG CCA TTG TCA AGG AAA GTG CCA ATT GCA TCG AGC AAA ATC AAT CCT 770 
Gin Pro Leu Ser Arg Lys Val Pro He Ala Ser Ser Lys He Asn Pro 

235 240 245 

TAT CGT ATG GTG ATT GTG GCT CGT CTA GTT ATC C4T GCT TTC TTT CTT 818 
Tyr Arg Met Val He Vat Ala Arg Leu Val He Leo Ala Phe Phe Leo 
250 255 260 265 

CGC TAT CGG ATT TTG AAC CCG GTA CAT GAT GCA ATT GGG CTT TGG CTA 866 
Arg Tyr Arg He Leu Asn Pro Val His Asp Ala He Gly Leo Trp Leo 

270 275 280 

ACT TCT GTG ATC TGT GAA ATC TGG TTT GCC TTT TCA TGG ATC CTT GAT 914 
Thr Ser Val He Cys Glo He Trp Phe Ala Phe Ser Trp He Leo Asp 

285 290 295 

CAG TTC CCT AAA TGG TTC CCT ATT GAC CGC GAG ACG TAT CTC GAT CGC 962 
Gin Phe Pro Lys Trp Phe Pro He Asp Arg Glo Thr Tyr Leo Asp Arg 

300 305 310 

CTT TCC CTC AGG TAT GAG AGG GAA GGT GAG CCC AAC ATG CTT GCT TCT 1010 
Leu Ser Leo Arg Tyr Glu Arg Glo Gly Glo Pro Asn Met Leo Ala Ser 

315 320 325 

GTT GAT ATT TTT GTC AGT ACA GTG GAT CCA TTG AAG GGA CCT CCT CTA 1058 
Val Asp lie Phe Val Ser Thr Val Asp Pro Leo Lys Gly Pro Pro Leu 
330 335 340 345 

GTA ACA GCG AAT ACA GTT CTA TCG ATC T i086 
Val Thr Ala Asn Thr Val Leu Ser He 

350 

to o 8 i j mmm^ : 8 h^py- : mmvt 

K8IcO*£ : 354 &WV>mm : 179- H 

mm<Dm: 7Z;& 7<?>f*> hM : N-teninal 

mm 

Met Glu Ala Ser Ala Gly Leu Val Ala Gly Ser His Asn Arg Asn Glo 

15 10 15 

Leo Val Val He His Gly His Glo Glo Pro Lys Pro Leo Lys Asn Leo 

20 25 30 

Asp Gly Gin Val Cys Glo He Cys Gly Asp Glu He Gly Leo Thr Val 

35 40 45 

Asp Gly Asp Leu Phe Val Ala Cys Asn Glu Cys Gly Phe Pro Val Cys 

50 55 60 

Arg Pro Cys Tyr Glu Tyr Glu Arg Arg Glu Gly Ser Gin Gin Cys Pro 
65 70 75 80 

Gin Cys Lys Thr Arg Tyr Lys Arg Leu Lys Gly Ser Pro Arg Val Glu 

85 90 95 

Gly Asp Glu Asp Glu Glu Asp Val Asp Asp He Glu His Glu Phe Asn 

100 105 l i 0 

He Asp Asp Gin Gin Asn Lys Tyr Arg Asn He Ala Gin Ser Met Leo 
115 120 125 



( 29 ) &m¥- 10-276782 

55 56 
His Gly Lys Met Ser Tyr Gly Arg Gly Pro Glu Asp Asp Gin Gly Leu 

130 135 - 140 

Gin He Pro Pro Gly Lea Ala Gly Val Arg Ser Arg Pro Val Ser Gly 
145 150 155 160 

Gin Phe Pro He Giy Ser Ser Leu Ala Tyr Gly Glu His Met Ser Asn 

165 170 175 

Lys Arg Val His Pro Tyr Pro Met Ser Glu Pro Gly Ser Ala Arg Trp 

180 185 190 

Asp Glu Lys Lys Glu Gly Gly Trp Arg Glu Arg Met Asp Asp Trp Lys 

195 200 205 

Met Gin Gin Gly Asn Leu Gly Pro Glu Pro Asp Asp Ala Tyr Asp Ala 

210 215 220 

Asp Met Ala Met Leu Asp Glu Ala Arg Gin Pro Leu Ser Arg Lys Val 
225 230 235 240 

Pro He Ala Ser Ser Lys He Asn Pro Tyr Arg Met Val He Val Ala 

245 250 255 

Arg Leu Val He Leu Ala Phe Phe Leu Arg Tyr Arg lie Leu Asn Pro 

260 265 270 

Val His Asp Ala He Gly Leu Trp Leu Thr Ser Val He Cys Glu He 

275 280 285 

Trp Phe Ala Phe Ser Trp lie Leu Asp Gin Phe Pro Lys Trp Phe Pro 

290 295 300 

Me Asp Arg Glu Thr Tyr Leu Asp Arg Leu Ser Leu Arg Tyr Glu Arg 
305 310 315 320 

Gin Gly Glu Pro Asn Met Leu Ala Ser Val Asp He Phe Val Ser Thr 

325 330 335 

Val Asp Pro Leu Lys Gly Pro Pro Leu Val Thr Ala Asn Thr Val Leu 

340 345 350 

Ser He 

[ 0 0 8 2 ] m&mn : 9 : (Gossypiura hirsutum L.) 

gB^JcD^^ : 1000 : Coker312 

m<D& : #&£3c-rie^ : cds 

h^Dy*- : \g.mVt : 1.. 1000 

IB^JOli : cDNA to nRNA ftm&fcfelsfzJjm : S 

mm 

GAC AAA GTC CGG CCG ACA TTC GTG AAG GAG CGT CGA GCT ATG AAG AGA 48 

Asp Lys Val Arg Pro Thr Phe Val Lys Glu Arg Arg Ala Met Lys Arg 

15 10 15 

GAA TAT GAA GAA TTC AAG GTT AGG ATA AAT GCA CTT GTA GCC AAA GCC 96 

Glu Tyr Glu Glu Pbe Lys Val Arg He Asn Ala Leu Val Ala Lys Ala 

20 25 30 

CAA AAG GTT CCT CCA GAA GGG TGG ATC ATG CAA GAT GGG ACA CCA TGG 144 

Gin Lys Val Pro Pro Glu Gly Trp He Met Gin Asp Gly Thr Pro Trp 

35 40 45 

CCA GGA AAC AAT ACT AAA GAT CAC CCT GGT ATG ATT CAA GTA TTT CTC 192 

Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He Gin Val Phe Leu 

50 55 60 



* 



288 



336 



384 
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57 58 
GGT CAA ACT GGA GGC CAT GAT ACC GAA GGA AAT GAG CTT CCT CGT CTC 240 
Gly Gin Ser Gly Gly His Asp Thr Glu Gly Asn Glo Leu Pro Arg Leo 
65 ™ 75 80 

GTC TAT GTA TCT CGA GAG AAA AGG CCA GGT TTC TTG CAT CAC AAG AAA 
Val Tyr Vai Ser Arg Glu Lys Arg Pro Gly Phe Leu His His Lys Lys 

85 90 95 

GCT GGT GCC ATG AAC GCC CTT GTT CGT GTC TCG GGG GTG CTT ACA AAT 
Ala Gly Ala Mel Asn Ala Leu Val Arg Val Ser Gly Val Leu Thr Asn 

100 105 no 

GCT CCT TTT ATG TTG AAC TTG GAT TGT GAC CAC TAT TTA AAT AAC AGC 
Ala Pro Phe Mel Leu Asn Leu Asp Cys Asp His Tyr Leu Asn Asn Ser 

*I5 120 125 

AAG GCT GTA AGA GAG GCT ATG TGT TTC TTG ATG GAC CCT CAA ATT GGA 432 
Lys Ala Val Arg Glu Ala Met Cys Pbe Leu Met Asp Pro Gin He Gly 

130 135 uo 

AGA AAG GTT TGC TAT GTC CAA TTC CCT CAA CGT TTC GAT GGT ATT GAT 480 
Arg Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp 
145 150 155 160 

AGA CAT GAT CGA TAT GCC AAT CGG AAC ACA GTT TTC TTT GAT ATT AAC 528 
Arg His Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp Me Asn 

165 170 175 

ATG AAA GGT CTA GAT GGT ATA CAA GGC CCT GTA TAT GTC GGC ACG GGG 576 
Met Lys Gly Leu Asp Gly He Gin Gly Pro Val Tyr Vai Gly Thr Gly 

1 80 185 190 

TGT GTT TTC AGA AGG CAA GCT CTT TAT GGT TAT GAA CCT CCA AAG GGA 624 
Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Glu Pro Pro Lys Gly 

195 200 205 

CCT AAG CGC CCG AAA ATG GTA ACC TGT GGT TGC TGC CCT TGC TTT GGA 672 
Pro Lys Arg Pro Lys Met Val Thr Cys Gly Cys Cys Pro Cys Phe Gly 

210 215 220 

CGC CGC AGA AAG GAC AAA AAG CAC TCT AAG GAT GGT GGA AAT GCA AAT 720 
Arg Arg Arg Lys Asp Lys Lys His Ser Lys Asp Gly Gly Asn Ala Asn 
225 230 235 240 

GGT CTA AGC CTA GAA GCA GCC GAA GAT GAC AAG GAG TTA TTG ATG TCC 768 
Gly Leu Ser Leu Glu Ala Ala Glu Asp Asp Lys Glu Leu Leu Mel Ser 

245 250 255 

CAC ATG AAC TTT GAA AAG AAA TTT GGA CAA TCA GCC ATT TTT GTA ACT 816 
His Met Asn Phe Glu Lys Lys Phe Gly Gin Ser Ala He Phe Val Thr 

260 265 270 

TCA ACA CTG ATG GAA CAA GGT GGT GTC CCT CCT TCT TCA AGC CCT GCA 864 
Ser Thr Leu Met Glu Gin Gly Gly Val Pro Pro Ser Ser Ser Pro Ala 

275 280 285 

GCT TTG CTC AAA GAA GCC ATT CAT GTA ATT AGT TGT GGT TAT GAA GAC 912 
Ala Leu Leu Lys Glu Ala He His Val He Ser Cys Gly Tyr Glu Asp 

290 295 300 

AAA ACC GAA TGG GGA AGC GAG CTT GGC TGG ATT TAC GGC TCG ATT ACA 
Lys Thr Glu Trp Gly Ser Glu Leu Gly Trp He Tyr Gly Ser He Thr 
305 3 '0 315 320 

GAA GAT ATC TTA ACA GGT TTC AAG ATG CAT TGC CGT GGA T 
Glu Asp He Leu Thr Gly Phe Lys Net His Cys Arg Gly 



960 



1000 
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59 60 
325 330 
[0 0 8 3] ffi*»J#*§ : 10 - h#nv-:i!Jgtt 

SB^iCDfi^ : 333 MF\<D$k$& : l"?*? H 

&&l(Dm:7$ym 7 7#* >bM : internal 

Asp Lys Val Arg Pro Thr Phe Val Lys Glo Arg Arg Ala Met Lys Arg 

1 5 10 15 

Glo Tyr Glo Glo Phe Lys Vai Arg lie Asn Ala Leo Val Ala Lys Ala 

20 25 30 

Gin Lys Val Pro Pro Glo Gly Trp lie Met Gin Asp Gly Thr Pro Trp 

35 40 45' 

Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met lie Gin Val Phe Leu 

50 55 60 

Gly Gin Ser Gly Gly His Asp Thr Glu Gly Asn Glo Leo Pro Arg Len 
65 70 75 80 

Val Tyr Val Ser Arg Gin Lys Arg Pro Gly Phe Len His His Lys Lys 

85 90 95 

Ala Gly Ala Met Asn Ala Len Val Arg Val Ser Gly Val Len Thr Asn 

100 105 110 

Ala Pro Phe Met Len Asn Leu Asp Cys Asp His Tyr Leu Asn Asn Ser 

115 120 125 

Lys Ala Val Arg Gin Ala Met Cys Phe Leu Met Asp Pro Gin lie Gly 

130 135 140 

Arg Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp 
145 150 155 160 

Arg His Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp He Asn 

165 170 175 

Met Lys Gly Leu Asp Gly lie Gin Gly Pro Val Tyr Val Gly Thr Gly 

180 185 190 

Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Glu Pro Pro Lys Gly 

195 200 205 

Pro Lys Arg Pro Lys Met Val Thr Cys Gly Cys Cys Pro Cys Phe Gly 

210 215 220 

Arg Arg Arg Lys Asp Lys Lys His Ser Lys Asp Gly Gly Asn Ala Asn 
225 230 235 240 

Gly Leu Ser Leu Glu Ala Ala Glu Asp Asp Lys Glu Leu Leu Met Ser 

245 250 255 

His Met Asn Phe Glu Lys Lys Phe Gly Gin Ser Ala lie Phe Val Thr 

260 265 270 

Ser Thr Leu Met Glu Gin Gly Gly Val Pro Pro Ser Ser Ser Pro Ala 

275 280 285 

Ala Leu Leu Lys Glu Ala He His Val lie Ser Cys Gly Tyr Glu Asp 

290 295 300 

Lys Thr Glu Trp Gly Ser Glu Leu Gly Trp He Tyr Gly Ser He Thr 
305 310 ' 315 320 

Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly 

325 330 
[0 0 8 4] £«t:l! E^JcDS^ : H 

fi^J©§$ : 333 7 7#* > hi! : C-teninal 



t 
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61 



62 



mm 



m<Ofe *S : XaateGluXteLysfc^T 



Asp Lys Val Arg Pro Thr Phe Val Lys Glu Arg Arg Ala Mel Lys Arg 

15 10 15 

Glu Tyr Glu Glu Phe Lys Val Arg lie Asn Ala Leu Val Ala Lys Ala 

20 25 30 

Gin Lys Val Pro Pro Glu Gly Trp lie Met Gin Asp Gly Thr Pro Trp 

35 40 45 

Pro Gly Asn Asn Thr Lys Asp His Pro Gly Met He Gin Val Pbe Leu 

50 55 60 

Gly Gin Ser Gly Gly His Asp Thr Glu Gly Asn Glu Leu Pro Arg Leu 
65 70 75 80 

Val Tyr Val Ser Arg Glu Lys Arg Pro Gly Phe Leu His His Lys Lys 

85 90 95 

Ala Gly Ala Met Asn Ala Leu Val Arg Val Ser Gly Val Leu Thr Asn 

100 105 110 

Ala Pro Phe Met Leu Asn Leu Asp Cys Asp His Tyr Leu Asn Asn Ser 

115 120 125 

Lys Ala Val Arg Glu Ala Met Cys Phe Leu Met Asp Pro Gin lie Gly 

130 135 140 

Arg Lys Val Cys Tyr Val Gin Phe Pro Gin Arg Phe Asp Gly He Asp 
145 150 155 160 

Arg His Asp Arg Tyr Ala Asn Arg Asn Thr Val Phe Phe Asp lie Asn 

165 170 175 

Met Lys Gly Leu Asp Gly lie Gin Gly Pro Val Tyr Val Gly Thr Gly 

180 185 190 

Cys Val Phe Arg Arg Gin Ala Leu Tyr Gly Tyr Glu Pro Pro Lys Gly • 

195 200 205 

Pro Lys Arg Pro Lys Met Val Thr Cys Gly Cys Cys Pro Cys Phe Gly 

210 215 220 

Arg Arg Arg Lys Asp Lys Lys His Ser Lys Asp Gly Gly Asn Ala Asn 
225 230 235 240 

Gly Leu Ser Leu Glu Ala Ala Xaa Asp Asp Lys Gin Leu Leu Met Ser 

245 250 255 

His Met Asn Phe Glu Lys Lys Phe Gly Gin Ser Ala He Phe Val Thr 

260 265 270 

Ser Thr Leu Met Glu Gin Gly Gly Val Pro Pro Ser Ser Ser Pro Ala 

275 280 285 

Ala Leu Leu Lys Glu Ala Me His Val He Ser Cys Gly Tyr Glu Asp 

290 295 300 

Lys Thr Glu Trp Gly Ser Glu Leu Gly Trp He Tyr Gly Ser He Thr 
305 310 315 320 

Glu Asp He Leu Thr Gly Phe Lys Met His Cys Arg Gly Trp Arg Ser 

325 330 335 

He Tyr Cys Met Pro Lys Leu Pro Ala Phe Lys Gly Ser Ala Pro He 

340 345 350 

Asn Leu Ser Asp Arg Leu Asn Gin Val Leu Arg Trp Ala Leu Gly Ser 

355 360 365 

Val Glu He Phe Phe Ser His His Cys Pro Ala Trp Tyr Gly Phe Lys 



( 33 ) ftfflW- 1 0 - 2 7 6 7 8 2 

63 64 
370 375 380 

G I y Gly Lys Leo Lys Trp Lea Gin Arg-Phe Ala Tyr Val Asa Thr Thr 
385 390 395 400 

He Tyr Pro Phe Thr Ser Leu Pro Leo Lea Ala Tyr Cys Thr Lea Pro 

405 410 415 

Ala lie Cys Leu Leu Thr Asp Lys Phe lie Met Pro Pro lie Ser Thr 

420 • 425 430 

Phe Ala Ser Leu Phe Phe He Ala Leu Phe Leu Ser lie Phe Ala Thr 

435 440 445 

Gly lie Leu Glu Leu Arg Trp Ser Gly Val Ser lie Glu Gin Trp Trp 

450 455 460 

Arg Asn Glu Gin Phe Trp Val lie Gly Gly lie Ser Ala His Leu Phe 
465 470 475 480 

Ala Val He Gin Gly Leu Leu Lys Val Leu Ala Gly He Asp Thr Asn 

485 490 495 

Phe Thr Val Thr Ser Lys Ala Thr Asp Asp Glu Glu Phe Gly Glu Leu 

500 505 510 

Tyr Thr Phe Lys Trp Thr Thr Leu Leu He Pro Pro Thr Thr Val Leu 

515 520 525 

He He Asn Leu Val Gly Val Val Ala Gly He Ser Asp Ala He Asn 

530 535 540 

Asn Gly Tyr Gin Ser Trp Gly Pro Leu Phe. Gly Lys Leu Phe Phe Ser 
545 550 555 560 

Phe Trp Val He Val His Leo Tyr Pro Phe Leu Lys Gly Leu Met Gly 

565 570 575 

Arg Gin Asn Arg Thr Pro Thr He Val Val He Trp Ser Val Leu Leu 

580 585 590 

Ala Ser He Phe Ser Leu Leu Trp Val Arg He Asp Pro Phe Val Met 

595 600 605 

Lys Thr Lys Gly Pro Asp Thr Thr Met Cys Gly He Asn Cys 
610 615 620 

[0 0 8 5] m&m^ : \z 1 5 

iB^jog^ : 9 [0086] mnm^ : 13 

&p\<dm : r ^ / m ge^jtogs : 54 

as;- : mmvt &p\<dm : &m 

7^tf* > : internal h4?Py- : 

Gin Xaa Xaa Xaa Xaa Xaa Xaa Arg Trp 

GAGAGAGAGA GAGAGAGAGA ACTAGTCTCG AGTTTTTTTT TTTTTTTTTT 54 
[0 0 8 7] £^lf^:14 S2JU<D#& 

mn(D&ts : 13 ftrnzmrtm: 

&&\<DM : mm 4 

AATTCGGCAC GAG 13 
[0 0 8 81 EfH#9 : 15 50 lE^JOgS : 20 



65 

mm<OM : mm 

m<o& : 

mm 

GACTGAAGAT AAGCCAAAAG 
C 0 0 8 9 ] SS^J#^ : 16 
mm<D&2 : 19 

mmom : mm 

mm 

GGAATGATGA ATTTGCCGG 
CO 0 9 0] gg^J#^ : 17 

mm<D&$ : 20 

mm<om : mm 

mm 

TGCAGGCAAC TTTGGCATGC 

coo9i] mmmn : i8 

S^J©1$ : 30 

i2^j(D^ : mm 

mm 

AGCAACACGA GCAAGATGAG GAGGATGACT 

c o o 9 2 ] mmm^ : i9 

SS CO ^ ^ : 28 

mrnoM : mm 

mm 

CCGGATCCTT CAACCCTTCT TCGATTTC 

[0093] mm&n 20 

mm<D&z : 29 

mm<om : mm 

mm 

CCGGATCCAC GGCAATGCAT CTTGAAACC 

[0094] mmmn : 21 
mm<D&$ : 17 

mm<DM : mm 

mm 

GGTTAGCATA TTGTTTGTAG CATTGGG 

[0095] mmmn : 22 

mmo&z : 17 

mmcoM : mm 

mm 

ATCAATGAAA TATGTATAGT TCATAGC 

[0096] mmm^ : 23 
mm<D&z : 17 

mm&m : 

CTTTCGTTCT TTTGGTTTTG CCATGGC 

[0097] mm&n za 
mmn&ti : 17 

mm&m : 

AGACTTTTTA CAAACAAGAT AAATCCC 

[0®COfB!i£&tfcipn 
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66 

20 

SB^JOii : Wi..^^DNA 

20 
30 
28 
29 

30 

17 
17 

40 flcogfc : 

17 

mmomm : ms.^^dna 

17 

50 mi] *%i(DDNAO-WT^PcsA3(D 



67 

[0 2] e c o r i r y'7*-<Dmm&7f;'?m. 

[0 3] PcsA3 -6 8 2 tPcsA3- , 3 l OgB 
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68 

- [04] P c s A 3 - 6 8 2 £ P c s A 3 - 3 ' (Dffi 



im i ) 



im 2 ] 



Pc»A3 



PciA3-5* 



PdoA3-682 



[04] 



5' AATTCGGCACGAQ 3* 
3* GCGQTQCTC S' 



550 560 570 580 590 600 

PceA3-682 OQ0CCT6TATATGTGGGCADGG6GT6TGTTTTCABAA6GCAAGCTCTTTAT66TTATQAA 



• * • • 



Pc©A3-3 1 GGOCCT GTAT AT GTC GGGACGGGGTGTGT TTTCAG AAGGCAAGCTCTTTATGGTT ATGAA 

560 570 580 590 600 610 

610 620 630 640 650 660 

PcsA3-682 CCTCCAAAQGGACCTAA0C0GCCGAAAATGGTAACCTGT6GTTGCTGCCCTTGTTTTGGA 



• a • m m 
m . • • • 



jfc « » • • • 
.«•..» ~ • • » • * - 



PcsA3~3' 



OCTCCAAAfiGGACGTAAGCQOCCGAAAATGGTAAOCTGTGGTTGCTGOOCTTGCTTTGGA 
620 630 640 650 660 670 

670 680 690 700 710 720 

PosA3-682 CGOCGCAGAAAGGACAAAAAGCACTCIAAGGATGGTGGAAATGCAAATGGTCTAAGGCTA 



• * 



Po3A3-3 f 



CQ0GGCAGAAAGGACAAAAAGCACTC1AAGGATQGTGGAAATOGAAATGGTCTAAGCCTA 
6B0 690 700 710 720 730 

730 740 750 760 770 780 

PcsA3-682 GAAGC AGCC AAAG AT GACAAGGAGT T ATTGATGTCCCAOATGAAOTTTQAAAAG AAATTT 



♦ • •> » 4| ' 
> * * ■ • 



. . * • » * * • 



« » • • 



Pes A3-3' GAA0CAGCCGAAGAT GAGAAG 3AGTT AT T GATG r CCCACATGAACTT7GAAAAG AAATTT 

740 750 760 770 780 790 

790 800 810 820 830 840 

PcaA3-682 GGADAATCAGCC ATTTTTGTAACT TCAW5ACT6ATGGAACAAGG TGGTGTGGGTCCTTCT 

Pc»A3-3' GGACAATCAGCC ATT TT TGT AACT TCAAC ACTGAT GGAACAAGGTGGTGTCCCTCCTTCT 

800 810 620 830 840 850 

850 860 870 880 890 900 

PcsA3-682 TCAAGCCOGGGAGCTtTGCTCAAAGAAGCCATTOATGTAATTAGTTOTQGTTATGAAGAC 

........a.. ............... »■» •■»••«.»•»•••••••••••••«• 

....... ....»«•...•••••••--•• .•».»...•••»---•-------•- 

PcsA3-3* TGAA0GGCT6CAGGT TTGCTCAAAGAAGCC ATTCATGT AAT T AO TTQTQGTT ATGAAGAC 

860 870 880 890 900 910 

910 920 930 940 950 960 

PC8A3-682 AAAACAGMTGGGGAAGCGAGCTTGGCTGGATTTACOGCTCGATTACAQAAQATATCTTA 

..... a • •» ---■--•---•»-••*■•*•***•■ 

» . - . . *** ....... ...... ,«..»..••••-».--••- 

PcsA3-3' AAAACCGAATGGGGAAGCGAGCTTGGCTGGATTTACGGCTCGATTACAQAAGATATCTTA 

920 930 940 950 960 970 

970 880 
PcftA3-882 ACAOQATTCAAGATGCATTGCCGTGGAT 



PosA3-3* ACAGGTTTCAAGATGCATTOOOGTGQAT 

980 990 1000 
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[0 3] 

10 20 30 40 50 60 

PcsA3-682 OCQACATTCSTQAAOQAGCQTCGAGCTATGAAGAGAGAATATOAAOAATTCAAQOTTAQQ 

*•--••»••...._. 



• ** * • ...... 

PcsA3~3* OCGACATTCGT6MGGA6CGTCGAGCTATGMGAGAGMTATGAAQMTTCAAGOTTAQ0 

20 30 40 50 60 70 

70 «> 90 100 110 120 

PcbA3~682 ATAAATQOA0TTQTAOCCAAAGCCCAAAAGG1 TCCTCC AOAAOOOTQGATCATGCAACAT 

. . I .. * I 

•••••••• • ........ . . . 

PcsA3-3 ' ATAMTGCACTTGTAQCCAAAGCOCAAMGGITC^TCCAGAAGGOTGGATCATGCAAGAT 

80 90 100 110 120 130 

130 140 150 160 170 180 

PcsA3-682 GGGAGAGCAT6GGCAGGAAACAATACT AAAGATGAOOC TGGT ATGATTCAAGT ATT TCTO 

Pc*A3-3' GGQACADCArG6MX^(K3AAACAATACTAA^ 

140 150 160 170 180 190 

190 200 210 220 230 240 

PosA3-682 GGTCAAA0T6 QAGGCC AT QATACC G AAG G AAATGAQCT TOO TCGTCTCGTC T AT GT ATCT 

• ♦ :::::::: 

PcsA3-3' GGTGAMGTGOAGGOCATGATAOCGAAGGAAATGACXSTTCCTCGTCTOGTCTATGTATCT 

200 210 220 230 240 250 

250 260 270 280 200 300 

Pc*A3-682 CGAGAGAAAAGGCCTGGTTTCTTGCATCACAAGAAAQCrGGTGGGATQAACGCCCTTGTT 

: : : : : : : : ■* : : : : : : : : : : : : : : - • :::::::::::::::::::::::::: : 

PosA3-3* CGAGAGAAAAGGCCAGGTTTCTTGCATCACAAGAAADGTGGTGCCATGAACGCCCTTGTT 

260 270 280 290 300 310 

310 320 330 340 350 360 

PcsA3-682 CGGQTCTCG6GGGTGCTCAGAAATGCTCCTTTTATGTTGAACTT0GATTGTGACCATTAT 

* • jfc »•»•»••»«• ■ . . ■ ^ . ...... 

■ • 

PcsA3-3' CGTGTCTCGBGGGTGCTTACAAATGCTCCTnTATGnG^TTGGATTGTGM 

320 330 340 350 360 370 

370 380 390 400 410 420 

Po*A3-682 TTAAATAACAGCAAGGCTGTAAGAGAGGCTATGTGTTTCrTGATGGACOGTCAAATTGGA 

PobA3-3' TTAAATAACAGCAAGGCTGTAAGAGA6GCTATGTG7TTCTTQATGGAG0CTCAMTTGGA 

380 390 400 410 420 430 

430 440 450 460 470 480 

PcsA3-682 AGAAAGGTTTGCTATGTOCAATTCCCTCAACGTTTOGATGGTATTGATAGACATGATCGA 



Pe&A3-3* AGAAAGGT T TGC T ATGTCCAATTCCCTCAACGTT TOGATGGTAT TGATAQAOAT GATCG A 

440 450 460 470 480 490 

490 500 510 520 530 540 

PcsA3-B82 7ATGCCAAT0GGAACACA0TTTTCTTTGATATTAACATGAAAGGTCTAGATGGTATACAA 



PbsA3-3 ' TATGCCAATOGGAAOACAGTTTTCTTTGATATT AACATGAAAGGTCTAGATGGTATAC AA 

500 510 520 530 540 550 



C5i> int.ci. * mwmn nnmrnmn 



F I 
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C12N 5/00 d 
//(C12N 15/09 ZNA 

CI2R 1:91 ) 



( 37 ) 
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